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(57) Abstract 



A biosynthetic method for producing vitamin C (ascorbic acid, L-ascorbic acid, or AA) is disclosed. Such a method includes 
fermentation of a genetically modified microorganism or plant to produce L-ascorbic acid. In particular, the present invention relates 
to the use of microorganisms and plants having at least one genetic modification to increase the action of an enzyme involved in 
the ascorbic acid biosynthetic pathway. Included is the use of nucleotide sequences encoding epimerases, including the endogenous 
GDP-D-mannose:GDP-L-galactose epimerase from the l^ascorbic acid pathway and homologues thereof for the purposes of improving 
the biosynthetic production of ascorbic acid. The present invention also relates to genetically modified microorganisms, such as strains of 
microalgae, bacteria and yeast useful for producing l^ascorbic acid, and to genetically modified plants, useful for producing consumable 
plant food products. 
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VITAMIN C PRODUCTION IN MICROORGANISMS AND PLANTS 



FIELD OF THE INVENTION 
The present invention relates to vitamin C (L-ascoibic acid) production using 
genetically modified microorganisms and plants. In particular, the present invention 
5 relates to the use of nucleotide sugar epimerase enzymes for the biological production of 
ascorbic acid in plants and microorganisms. 

BACKGROUND OF THE INVENTION 
Nearly all forms of life, both plant and animal, either synthesize ascorbic acid 
(vitamin C) or require it as a nutrient. Ascorbic acid was first identified to be useful as a 

10 dietary supplement for humans and animals for the prevention of scurvy. Ascorbic acid, 
however, also affects human physiological functions such as the adsorption of iron, cold 
tolerance, the maintenance of the adrenal cortex, wound healing, the synthesis of 
polysaccharides and collagen, the formation of cartilage, dentine, bone and teeth, the 
maintenance of capillaries, and is useful as an antioxidant. 

IS For use as a dietary supplement, ascorbic acid can be isolated from natural 

sources, such as rosehips, synthesized chemically through the oxidation of L-sorbose, or 
produced by the oxidative fermentation of calcium D-gluconate by Acetobacter 
suboxidans. Considine, "Ascorbic Acid," Van Nostrand's Scientific Encyclopedia, Vol. 
1, pp. 237-238, (1989). Ascorbic acid (predominantly intracellular) has also been 

20 obtained through the fermentation of strains of the microalga, Chlorella pyrenoidosa. See 
U.S. Patent No. 5,001,059 by Skatrud, which is assigned to the assignee of the present 
application. It is believed that ascorbic acid is produced inside the chloroplasts of 
photosynthetic microorganisms and functions to neutralize energetic electrons produced 
during photosynthesis. Accordingly, ascoibic acid production is known in photosynthetic 

25 organisms as a protective mechanism. 

Therefore, products and processes which improve the ability to biosynthetically 
produce ascorbic acid are desirable and beneficial for the improvement of human health. 

SUMMARY OF THE INVENTION 
One embodiment of the present invention relates to a method for producing 
30 ascoibic add or esters thereof in a microorganism. The method includes the steps of: (a) 
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culturing a microorganism having a genetic modification to increase the action of an 
enzyme selected from the group of hexokinase, glucose phosphate isomerase, 
phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, 
GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L- 

5 galactose- 1-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-y-lactone 
dehydrogenase; and (b) recovering the ascorbic acid or esters produced by the 
microorganism. Preferably, the genetic modification is a genetic modification to increase 
the action of an enzyme selected from the group of GDP-D-mannose:GDP-L-galactose 
epimerase, GDP-L-galactose phosphorylase, L-galactose- 1-P-phosphatase, L-galactose 

10 dehydrogenase, and/or L-galactono-y -lactone dehydrogenase. In one embodiment of the 
method of the present invention, the microorganism further includes a genetic 
modification to decrease the action of an enzyme having GDP-D-mannose as a substrate, 
other than GDP-D-mannose:GDP-L-galactose epimerase. Such a genetic modification 
can include, for example, a genetic modification to decrease the action of GDP-D- 

1 5 mannose-dehydrogenase. 

In one embodiment, the genetic modification is a genetic modification to increase 
the action of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose, which can include GDP-D-mannose: GDP-L-galactose epimerase. In one 
embodiment, the epimerase binds NADPH. In one embodiment of this method, the 

20 genetic modification includes transformation of the microorganism with a recombinant 
nucleic add molecule that expresses the epimerase. Such an epimerase can have a tertiary 
structure that substantially conforms to the tertiary structure of a GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase represented by atomic coordinates having Brookhaven 
Protein Data Bank Accession Code Ibws. Preferably, the epimerase has a structure 

25 having an average root mean square deviation of less than about 2.5 A, and more 
preferably less than about 1 A, over at least about 25°/o of Ca positions of the tertiary 
structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by 
atomic coordinates having Brookhaven Protein Data Bank Accession Code lbws. 

In one embodiment, the epimerase comprises a substrate binding site having a 

30 tertiary structure that substantially conforms to the tertiary structure of the substrate 
binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by 
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atomic coordinates having Brookhaven Protein Data Bank Accession Code Ibws. Such 
a substrate binding site preferably has a tertiary structure with an average root mean 
square deviation of less than about 2.5 A over at least about 25% of Ca positions of the 
tertiary structure of a substrate binding site of a GDP-4-keto-6-deoxy-D-mannose 
5 epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 
Bank Accession Code Ibws. 

In another embodiment, the epimerase comprises a catalytic site having a tertiary 
structure that substantially conforms to the tertiary structure of the catalytic site of a 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates 

10 having Brookhaven Protein Data Bank Accession Code Ibws. Such a catalytic she 
preferably has a tertiary structure with an average root mean square deviation of less than 
about 1 A over at least about 25% of Ca positions of the tertiary structure of a catalytic 
site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic 
coordinates having Brookhaven Protein Data Bank Accession Code Ibws. The catalytic 

15 site preferably includes the amino acid residues serine, tyrosine and lysine and in one 
embodiment, the tertiary structure positions of the amino acid residues serine, tyrosine and 
lysine substantially conform to tertiary structure positions of residues Serl07, Tyrl36 and 
Lysl40, respectively, as represented by atomic coordinates in Brookhaven Protein Data 
Bank Accession Code Ibws. 

20 In yet another embodiment of this method, the epimerase comprises an amino acid 

sequence that aligns with SEQ ID NO: 1 1 using a CLUSTAL alignment program, wherein 
amino acid residues in the amino acid sequence align with 100% identity with at least 
about 50%, and in another embodiment with at least about 75%, and in yet another 
embodiment with at least about 90% of non-Xaa residues in SEQ ID NO: 1 1 . In another 

25 embodiment, the epimerase comprises an amino acid sequence having at least 4 
contiguous amino acid residues that are 100% identical to at least 4 contiguous amino acid 
residues of an amino acid sequence selected from the group of SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID NO:8 and SEQ ID NO: 10. In yet another embodiment, 
the recombinant nucleic acid molecule comprises a nucleic acid sequence comprising at 

30 least about 12 contiguous nucleotides having 100% identity with at least about 12 
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contiguous nucleotides of a nucleic acid sequence selected from the group of SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 and SEQ ID N0.9. 

In yet another embodiment of this method of the present invention, the epimerase 
comprises an amino acid sequence having a motif: Gly-Xaa-Xaa-Gly-Xaa-Xaa-Gly. In 
5 yet another embodiment, the recombinant nucleic acid molecule comprises a nucleic acid 
sequence that is at least about 15% identical, and in another embodiment, at least about 
20% identical, and in another embodiment, at least about 25% identical, to a nucleic acid 
sequence selected from the group of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:5, SEQ 
ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method with Lipman- 
1 0 Pearson standard default parameters. 

In yet another embodiment of this method of the present invention, the 
recombinant nucleic add molecule comprises a nucleic acid sequence that hybridizes under 
stringent hybridization conditions to a nucleic acid sequence encoding a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase. The nucleic acid sequence encoding the GDP-4- 
15 keto-6-deoxy-D-mannose epimerase/reductase includes nucleic acid sequences selected 
from the group of SEQ ID NO:l, SEQ ID NO:3 and SEQ ID NO:5, and the GDP-4-keto- 
6-deoxy-D-mannose epimerase/reductase can include an amino acid sequence selected 
from the group of SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6. 

In one embodiment of the method of the present invention, the microorganism is 
20 selected from the group of bacteria, fungi and microalgae. In one embodiment, the 
microorganism is acid-tolerant. Preferred bacteria include, but are not limited to 
Azotobacter and Pseudomonas. Preferred fungi include, but are not limited to, yeast, 
including, but not limited to Saccharomyces yeast. Preferred microalgae include, but are 
not limited to, microalgae of the genera Prototheca and Chlorella, with microalgae of the 
25 genus Prototheca being particularly preferred. 

In yet another embodiment of the method of the present invention, the 
microorganism is acid-tolerant and the step of culturing is conducted at a pH of less than 
about 6.0, and more preferably, at a pH of less than about 5.5, and even more preferably, 
at a pH of less than about 5.0. The step of culturing can be conducted in a fermentation 
30 medium that comprises a carbon source other than D-mannose in one embodiment, and 
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in another embodiment, the step of culturing is conducted in a fermentation medium that 
comprises glucose as a carbon source. 

In yet another embodiment of the present method, the step of culturing is 
conducted in a fermentation medium that is magnesium (Mg) limited. Preferably, the step 
5 of culturing is conducted in a fermentation medium that is Mg limited during a cell growth 
phase. In one embodiment, the fermentation medium includes less than about 0.5 g/L of 
Mg during a cell growth phase, and more preferably, less than about 0.2 g/L of Mg during 
a cell growth phase, and even more preferably, less than about 0. 1 g/L of Mg during a cell 
growth phase. 

10 Another embodiment of the present invention relates to a microorganism for 

producing ascorbic acid or esters thereof. The microorganism has a genetic modification 
to increase the action of an enzyme selected from the group of hexokinase, glucose 
phosphate isomerase, phosphomannose isomerase, phosphomannomutase, GDP-D- 
mannose pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L- 

15 galactose phosphorylase, L-galactose-l-P-phosphatase, L-galactose dehydrogenase, 
and/or L-galactono-y-lactone dehydrogenase. Preferably, the genetic modification is a 
genetic modification to increase the action of an enzyme selected from the group of GDP- 
D-mannose:GDP-I^gaIactose epimerase, GDP-L-galactose phosphorylase, L-galactose-l- 
P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-y -lactone dehydrogenase, 

20 and even more preferably, to increase the action of GDP-D-mannose:GDP-L-galactose 
epimerase. 

In one embodiment, the microorganism has been genetically modified to express 
a recombinant nucleic acid molecule encoding an epimerase that catalyzes conversion of 
GDP-D-mannose to GDP-L-galactose, wherein the epimerase has a tertiary structure 

25 having an average root mean square deviation of less than about 2.5 A over at least about 
25% of Ccc positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 
Bank Accession Code lbws. In another embodiment, the microorganism has been 
genetically modified to express a recombinant nucleic acid molecule encoding an 

30 epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein the 
epimerase comprises an amino acid sequence that aligns with SEQ ID NO: 11 using a 
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CLUSTAL alignment program, wherein amino acid residues in the amino acid sequence 
align with 100% identity with at least about 50% of non-Xaa residues in SEQ ID NO: 1 1 . 
Preferred microorganisms are disclosed as for the method discussed above. 

Yet another embodiment of the present invention relates to a plant for producing 

5 ascorbic acid or esters thereof. Such a plant has a genetic modification to increase the 
action of an enzyme selected from the group of hexokinase, glucose phosphate isomerase, 
phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, 
GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L- 
galactose-l-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-y -lactone 

10 dehydrogenase. In a preferred embodiment, the genetic modification is a genetic 
modification to increase the action of an enzyme selected from the group of GDP-D- 
mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-l-P- 
phosphatase, L-galactose dehydrogenase, and/or L-galactono-y-lactone dehydrogenase, 
and in a more preferred embodiment, the genetic modification is a genetic modification 

15 to increase the action of GDP-D-mannose:GDP-L-galactose epimerase. 

In one embodiment, the plant further comprises a genetic modification to decrease 
the action of an enzyme having GDP-D-mannose as a substrate other than GDP-D- 
mannose;GDP -L-galactose epimerase. Such a genetic modification includes a genetic 
modification to decrease the action of GDP-D-mannose-dehydrogenase. Such a plant also 

20 includes a plant that has been genetically modified to express a recombinant nucleic acid 
molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose, wherein the epimerase has a tertiary structure having an average root mean 
square deviation of less than about 2.5 A over at least about 25% of Ca positions of the 
tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented 

25 by atomic coordinates having Brookhaven Protein Data Bank Accession Code lbws. In 
another embodiment, such a plant has been genetically modified to express a recombinant 
nucleic acid molecule encoding an epimerase that catalyzes conversion of GDP-D- 
mannose to GDP-L-galactose, wherein the epimerase comprises an amino acid sequence 
that aligns with SEQ ID NO: 11 using a CLUSTAL alignment program, wherein amino 

30 acid residues in the amino add sequence align with 100% identity with at least about 50% 
of non-Xaa residues in SEQ ID NO: 1 1 . 
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In one embodiment, a plant for producing ascorbic add or esters thereof according 
to the present invention is a microalga. Preferred microalgae include, but are not limited 
to microalgae of the genera Prototheca and Chlorella, with microalga of the genus 
Prototheca being particularly preferred. In another embodiment, the plant is a higher 
5 plant, with consumable higher plants being more preferred. 

BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 A is a schematic drawing of the pathway from glucose to GDP-D-mannose 
in plants. 

Fig. IB is a schematic drawing of the pathway from GDP-D-mannose to L- 
1 0 galactose- 1 -phosphate in plants. 

Fig. 1C is a schematic drawing of the pathway from L-galactose to L-ascorbic acid 
in plants. 

Fig. 2A is a schematic drawing of selected carbon flow from glucose in 
Prototheca. 

15 Fig. 2B is a schematic drawing of selected carbon flow from glucose in 

Prototheca. 

Fig. 3 is a schematic drawing that shows the lineage of mutants derived from 
Prototheca moriformis ATCC 75669, and their ability to produce L-ascorbic acid. 

Fig. 4 is a bar graph illustrating the conversion of substrates by resting cells of 
20 strain NA45-3 following growth in media containing various magnesium concentrations 
and resuspension in media containing various magnesium concentrations. 

Fig. 5 is a line graph showing the relationship between specific ascorbic acid 
formation in cultures of Prototheca strains and the specific activity of GDP-D- 
mannose:GDP-L-galactose epimerase in extracts prepared from cells harvested from the 
25 same cultures. 

Fig. 6 is a line graph showing the relationship between specific epimerase activity 
and the degree of magnesium limitation in two strains, ATCC 75669 and EMS 13-4. 

Fig. 7 depicts the overall catalytic mechanism of GDP-D-mannose:GDP-L- 
galactose epimerase proposed by Barber (1979, J. BioL Chem. 254:7600-7603). 
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Fig. 8A depicts the catalytic mechanism of GDP-D-mannose-4,6-dehydratase 
(converts GDP-D-mannose to GDP-4-keto-6-deoxy-D-mannose). 

Fig. 8B depicts the catalytic mechanism of GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase (converts GDP-4-keto-6-deoxy-D-mannose to GDP-L-fucose) 
5 (Chang, et al., 1988, J. Biol Chem 263:1693-1697; Barber, 1980, Plant Physiol 66:326- 
329). 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates to a biosynthetic method and production 
microorganisms and plants for producing vitamin C (ascorbic acid, L-ascorbic acid, or 

10 AA). Such a method includes fermentation of a genetically modified microorganism to 
produce L-ascorbic acid. In particular, the present invention relates to the use of 
nucleotide sequences encoding epimerases, including the endogenous GDP-D- 
mannose:GDP-L-galactose epimerase from the L-ascorbic acid pathway, as well as 
epimerases having structural homology (e.g., by nucleotide/amino acid sequence and/or 

15 tertiary structure of the encoded protein) to GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductases, or UDP-galactose 4-epimerases, for the purposes of improving the 
biosynthetic production of ascorbic add. The present invention also relates to genetically 
modified microorganisms, such as strains of microalgae, bacteria and yeast useful for 
producing L-ascorbic acid, and to genetically modified plants, useful for producing 

20 consumable plant food products. 

One embodiment of the present invention relates to a method to produce L- 
ascorbic acid by fermentation of a genetically modified microorganism. This method 
includes the steps of (a) culturing in a fermentation medium a microorganism having a 
genetic modification to increase the action of an enzyme selected from the group of 

25 hexokinase, glucose phosphate isomerase, phosphomannose isomerase, 
phosphomannomutase, GDP-mannose pyrophosphorylase, GDP-D-mannose:GDP-L- 
galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-l-P-phosphatase, L- 
galactose dehydrogenase, and L-galactono-y-kctone dehydrogenase; and (b) recovering 
L-ascort>ic acid or esters thereof. The various enzymes in this list represent the enzymes 

30 involved in the vitamin C biosynthetic pathway in plants. It is uncertain at this time 
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whether the enzyme represented by GDP-L-galactose phosphorylase is actually a 
phosphorylase or a pyrophosphorylase (i.e., GDP-L-galactose pyrophosphorylase). 
Therefore, use of the term "GDP-L-galactose phosphorylase" herein refers to either GDP- 
L-galactose phosphorylase or GDP-L-galactose pyrophosphorylase. In one aspect of the 
5 invention, this method includes the step of culturing in a fermentation medium a 
microorganism having a genetic modification to increase the action of an epimerase that 
catalyzes conversion of GDP-D-mannose to GDP-L-galactose. This aspect of the present 
invention is discussed in detail below. 

Another embodiment of the present invention relates to a genetically modified 

10 microorganism for producing L-ascorbic acid or esters thereof. Another embodiment of 
the present invention relates to a genetically modified plant for producing L-ascoibic acid 
or esters thereof. Both genetically modified microorganisms (e.g., bacteria, yeast, 
microalgae) and plants (e.g., higher plants, microalgae) have a genetic modification to 
increase the action of an enzyme selected from the group of hexokinase, glucose 

15 phosphate isomerase, phosphomannose isomerase, phosphomannomutase, GDP-mannose 
pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose-l-P-phosphatase, L-galactose dehydrogenase, and/or L- 
galactono-y-lactone dehydrogenase. In a preferred embodiment, both genetically 
modified microorganisms (e.g., bacteria, yeast, microalgae) and plants (e.g., higher plants, 

20 microalgae) have a genetic modification to increase the action of an epimerase that 
catalyzes conversion of GDP-D-mannose to GDP-L-galactose. In one embodiment, the 
genetic modification includes the transformation of the microorganism or plant with the 
epimerase as described above. 

To produce significantly high yields of L-ascorbic acid by the method of the 

25 present invention, a plant and/or microorganism is genetically modified to enhance 
production of L-ascorbic acid. As used herein, a genetically modified plant (such as a 
higher plant or microalgae) or microorganism, such as a microalga (Prototheca, 
Chlorella), Escherichia coli, or a yeast, is modified (i.e., mutated or changed) within its 
genome and/or by recombinant technology (Le., genetic engineering) from its normal (i.e., 

30 wild-type or naturally occurring) form. In a preferred embodiment, a genetically modified 
plant or microorganism according to the present invention has been modified by 
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recombinant technology. Genetic modification of a plant or microorganism can be 
accomplished using classical strain development and/or molecular genetic techniques, 
include genetic engineering techniques. Such techniques are generally disclosed herein 
and are additionally disclosed, for example, in Sambrook et al., 1989, Molecular Cloning: 

5 A Laboratory Manual, Cold Spring Harbor Labs Press; Roessler, 1995, Plant Lipid 
Metabolism, pp. 46-48; and Roessler et al., 1994, in Byconversion for Fuels, Himmel et 
al. eds., American Chemical Society, Washington D.C., pp 255-70). These references are 
incorporated by reference herein in their entirety. 

In some embodiments, a genetically modified plant or microorganism can include 

10 a natural genetic variant as well as a plant or microorganism in which nucleic acid 
molecules have been inserted, deleted or modified, including by mutation of endogenous 
genes (e.g., by insertion, deletion, substitution, and/or inversion of nucleotides), in such 
a manner that the modifications provide the desired effect within the plant or 
microorganism. As discussed above, a genetically modified plant or microorganism 

15 includes a plant or microorganism that has been modified using recombinant technology. 

As used herein, genetic modifications which result in a decrease in gene 
expression, an increase in inhibition of gene expression or inhibition of a gene product 
(i.e., the protein encoded by the gene), a decrease in the function of the gene, or a 
decrease in the function of the gene product can be referred to as inactivation (complete 

20 or partial), deletion, interruption, blockage, down-regulation, or decreased action of a 
gene. For example, a genetic modification in a gene which results in a decrease in the 
function of the protein encoded by such gene can be the result of a complete deletion of 
the gene encoding the protein (i.e., the gene does not exi* s and therefore the protein does 
not exist), a mutation in the gene encoding the protein which results in incomplete or no 

25 translation of the protein (e.g., the protein is not expressed), or a mutation in the gene 
which decreases or abolishes the natural function of the protein (e.g., a protein is 
expressed which has decreased or no enzymatic activity). 

Genetic modifications which result in an increase in gene expression or function 
can be referred to as amplification, overproduction, overexpression, activation, 

30 enhancement, addition, uj>-regulation or increased action of a gene. Additionally, a genetic 
modification to a gene which modifies the expression, function, or activity of the gene can 
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have an impact on the action of other genes and their expression products within a given 
metabolic pathway (e.g., by inhibition or competition). In this embodiment, the action 
(e.g., activity) of a particular gene and/or its product can be affected (i.e., upregulated or 
downregulated) by a genetic modification to another gene within the same metabolic 
5 pathway, or to a gene within a different metabolic pathway which impacts the pathway of 
interest by competition, inhibition, substrate formation, etc. 

In general, a plant or microorganism having a genetic modification that affects L- 
ascoibic acid production has at least one genetic modification, as discussed above, which 
results in a change in the L-ascorbic acid production pathway as compared to a wild-type 

10 plant or microorganism grown or cultured under the same conditions. Such a 
modification in an L-ascorbic acid production pathway changes the ability of the plant or 
microorganism to produce L-ascorbic acid. According to the present invention, a 
genetically modified plant or microorganism preferably has an enhanced ability to produce 
L-ascorbic add compared to a wild-type plant or microorganism cultured under the same 

15 conditions. 

The present invention is based on the present inventors' discovery of the 
biosynthetic pathway for I^ascoibic acid (vitamin C) in plants and microorganisms. Prior 
to the present invention, the metabolic pathway by which plants produce L-ascorbic acid, 
was not completely elucidated. The present inventors have demonstrated that L-ascorbic 

20 acid production in plants, including L-ascorbic acid-producing microorganisms (e.g., 
microalgae), is a pathway which uses GDP-D-mannose and involves sugar phosphates and 
NDP-sugars. In addition, the present inventors have made the surprising discovery that 
both L-galactose and L-galactono-y-lactone can be rapidly converted into L-ascorbic acid 
in L-ascorbic acid-producing microalgae, including Prototheca and Chlorella 

25 pyrenoidosa. The entire pathway for L-ascorbic acid production in plants is set forth in 
Figs. 1 A-1C. More particularly, Fig. 1 A shows that the production of L-ascorbic acid in 
plants proceeds through the production of mannose intermediates to GDP-D-mannose, 
followed by the conversion of GDP-D-mannose to GDP-L-galactose by GDP-D- 
mannose:GDP-L-galactose epimerase (also known as GDP-D-mannose-3,5-epimerase) 

30 (Fig. IB), and then by the subsequent progression to L-galactose- 1-P, L-galactose, L- 
galactonic acid (optional), L-galactono-y-lactone, and L-ascorbic acid (Fig. 1C). Fig. IB 
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also illustrates alternate pathways for the use of various intermediates, such as GDP-D- 
mannose. Certain aspects of this pathway have been independently described in a 
publication (Wheeler, et al., 1998, Nature 393:365-369), incorporated herein by reference 
in its entirety. 

5 Points within the L-ascorbic acid production pathway which can be targeted by 

genetic modification to affect the production of L-ascoibic acid can generally be 
catagorized into at least one of the following pathways: (a) pathways affecting the 
production of GDP-D-mannose (e.g., pathways for converting a carbon source into GDP- 
D-mannose); (b) pathways for converting GDP-D-mannose into other compounds, (c) 
10 pathways associated with or downstream of the action of GDP-D-mannose:GDP-L- 
galactose epimerase, (d) pathways which compete for substrates involved in the 
production of any of the intermediates within the L-ascoibic acid production pathway, and 
in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-1 -phosphate, L- 
galactose, L-galactono-y-lactone, and/or L-ascorbic acid; and (e) pathways which inhibit 
production of any of the intermediates within the L-ascorbic acid production pathway, and 
in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-l-phosphate, L- 
galactose, L-galactono-y-lactone, and/or L-ascorbic acid. 

A genetically modified plant or microorganism useful in a method of the present 
invention typically has at least one genetic modification in the L-ascorfric acid production 
pathway which results in an enhanced production of L-ascorbic acid. In one embodiment, 
a genetically modified plant or microorganism has at least one genetic modification that 
results in: (a) an enhanced production of GDP-D-mannose; (b) an inhibition of pathways 
which convert GDP-D-mannose into compounds other than GDP-L-galactose; (c) an 
enhancement of action of the GDP-D-mannose:GDP-L-galactose epimerase; (d) an 
enhancement of the action of enzymes downstream of the GDP-D-mannose:GDP-L- 
galactose epimerase; (e) an inhibition of pathways which compete for substrates involved 
in the production of any of the intermediates within the L-ascorbic acid production 
pathway, and in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-l- 
phosphate, L-galactose, L-galactono-y-lactone, and/or L-ascorbic acid; and (e) an 
inhibition of pathways which inhibit production of any of the intermediates within the L- 
ascorbic acid production pathway, and in particular, with GDP-D-mannose, GDP-L- 
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galactose, L-galactose-1 -phosphate, L-galactose, L-galactono-y-lactone, and/or L~ 
ascorbic acid. 

An enhanced production of GDP-D-mannose by genetic modification of the plant 
or microorganism can be achieved by, for example, overexpression of enzymes such as 
5 hexokinase, glucose phosphate isomerase, phosphomannose isomerase (PMI), 
phosphomannomutase (PMM) and/or GDP-D-mannose pyrophosphorylase (GMP). 
Inhibition of pathways which convert GDP-D-mannose to compounds other than GDP-L- 
galactose can be achieved, for example, by modifications which inhibit polysaccharide 
synthesis, GDP-D-rhamnose synthesis, GDP-L-fucose synthesis and/or GDP-D- 

10 mannuronic acid synthesis. An increase in the action of the GDP-D-mannose:GDP-L- 
galactose epimerase and of enzymes downstream of the epimerase in the L-ascorbic acid 
production pathway can be achieved by genetic modifications which include, but are not 
limited to: overexpression of the epimerase gene (Le> by overexpression of a recombinant 
nucleic add molecule encoding the epimerase gene or a homologue thereof (discussed in 

15 detail below), and/or by mutation of the endogenous or recombinant gene to enhance 
expression of the gene) and/or overexpression of genes downstream of the epimerase 
which encode subsequent enzymes in the L-ascorbic acid pathway. Finally, metabolic 
pathways which compete with or inhibit the L-ascorbic acid production pathway can be 
inhibited by deleting or mutating enzymes, substrates or products which either inhibit or 

20 compete for an enzyme, substrate or product in the L-ascorbic acid pathway. 

As discussed above, a genetically modified plant or microorganism useful in the 
method of the present invention can have at least one genetic modification (e.g., mutation 
in the endogenous gene or addition of a recombinant gene) in a gene encoding an enzyme 
involved in the L-ascorbic acid production pathway. Such genetic modifications 

25 preferably increase (i.e., enhance) the action of such enzymes such that L-ascorbic acid 
is preferentially produced as compared to other possible end products in related metabolic 
pathways. Such genetic modifications include, but are not limited to, overexpression of 
the gene encoding such enzyme, and deletion, mutation, or downregulation of genes 
encoding competitors or inhibitors of such enzyme. Preferred enzymes for which the 

30 action of the gene encoding such enzyme can be genetically modified include: hexokinase, 
glucose phosphate isomerase, phosphomannose isomerase (PMI), phosphomannomutase 
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(PMM), GDP-D-mannose pyrophosphorylase (GMP), GDP-D-mannose:GDP-L-galactosc 
epimerase, GDP-L-galactose phosphorylase, L-galactose-l-P-phosphatase, L-galactose 
dehydrogenase, and/or L-galactono-Y-lactone dehydrogenase. More preferably, a 
genetically modified plant or microorganism useful in the present invention has a genetic 

5 modification which increases the action of an enzyme selected from the group of GDP-D- 
mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-l-P- 
phosphatase, L-galactose dehydrogenase, and/or L-galactono-Y-lactone dehydrogenase. 
Even more preferably, a genetically modified plant or microorganism useful in the present 
invention has a genetic modification which increases the action of GDP-D-mannose:GDP- 

10 L-galactose epimerase. These enzymes and the reactions catalyzed by such enzymes are 
illustrated in Figs. 1 A-1C. 

Prior to the present invention, without knowing the L-ascoibic acid biosynthetic 
(i.e., production) pathway, previous mutagenesis and screening efforts were limited in that 
only non-lethal mutations could be detected. One embodiment of the present invention 

15 relates to elimination of a key competing enzyme that diverts caibon flow from L-ascorbic 
add synthesis. If such enzyme is absolutely required for growth on glucose, then mutants 
lacking the enzyme (and, therefore, having increased carbon flow to L-ascorbic add) 
would have been nonviable and not have been detected during prior screening efforts. 
One such enzyme is phosphofructokinase (PFK) (See Fig. 2A). PFK is required for 

20 growth on glucose, and is the major step drawing carbon away from L-ascorbic add 
biosynthesis (Fig. 2A). Elimination of PFK would render the cells nonviable on glucose- 
based media. Selection of a conditional mutant where PFK was inactivated by temperature 
shift, for example, may allow development of a L-ascorbic acid process where cell growth 
is achieved under permissive fermentation conditions, and L-ascorbic acid production 

25 (from glucose) is initiated by a shift to non-permissive condition. In this example, the 
temperature shift would eliminate carbon flow from glucose to glycolysis via PFK, thereby 
shunting carbon into the L-ascorbic acid branch of metabolism. This approach has 
application not only in natural L-ascorbic add producing organisms, but also in L-ascorbic 
add recombinant systems (genetically engineered plant or microorganisms) as discussed 

30 herein. 
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Knowing the identity and mechanism of the rate-limiting pathway enzymes in the 
I^ascorbic acid production pathway allows for design of specific inhibitors of the enzymes 
that are also growth inhibitory. Selection of mutants resistant to the inhibitors allows for 
the isolation of strains that contain L-ascoibic acid-pathway enzymes with more favorable 
5 kinetic properties. Therefore, one embodiment of the present invention is to identify 
inhibitors of the enzymes that are also growth inhibitory. These inhibitors are then used 
to select genetic mutants that overcome this inhibition and produce L-ascorbic acid at high 
levels. In this embodiment, the resultant plant or microorganism is a non-recombinant 
strain which can then be further modified by recombinant technology, if desired. In 

10 recombinant L-ascorbic add producing strains, random mutagenesis and screening can be 
used as a final step to increase L-ascorbic acid production. 

In yet another embodiment genetic modifications are made to an L-ascorbic acid 
producing organism directly. This allows one to build upon a base of data acquired during 
prior classical strain improvement efforts, and perhaps more importantly, allows one to 

15 take advantage of undefined beneficial mutations that occurred during classical strain 
improvement. Furthermore, fewer problems are encountered when expressing native, 
rather than heterologous, genes. The most advanced system for development of genetic 
systems for microalgae has been developed for Chlamydomonas reinhardtii. Preferably, 
development of such a genetically modified production organism would include: isolation 

20 of mutant(s) with a specific nutritional requirement for use with a cloned selectable marker 
gene (similar to the ura3 mutants used in yeast and fungal systems); a cloned selectable 
marker such as URA3 or alternatively, identification and cloning of a gene that specifies 
resistance to a toxic compound (this would be analogous to the use of antibiotic resistance 
genes in bacterial systems, and, as is the case in yeast and other fungi, a means of 

25 inserting/removing the marker gene repeatedly would be required, unless several different 
selectable markers were developed); a transformation system for introducing DNA into 
the production organism and achieving stable transformation and expression; and, a 
promoter system (preferably several) for high-level expression of cloned genes in the 
organism. 

30 Another embodiment of the present invention, discussed in detail below, is to place 

key genes or allelic variants and homologies thereof from L-ascorbic acid producing 
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organisms (Le., higher plants and microalgae) into a plant or microorganism that is more 
amenable to molecular genetic manipulation, including endogenous L-ascorbic add 
producing microorganisms and suitable plants. For example, it is possible to identify a 
suitable non-pathogenic organism based on the requirement of growth (on glucose) at low 

5 pH (i.e., acid-tolerant organisms, discussed in detail below). 

One suitable candidate for recombinant production in any suitable host organism 
is the gene (nucleic acid molecule) encoding GDP-D-maimose:GDP-L-galactose 
epimerase and homologues of the GDP-D-mannose:GDP-L-galactose epimerase, as well 
as any other epimerase that has structural homology at the primary (i.e., sequence) or 

10 tertiaiy (i.e., three dimensional) level, to a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase, or to a UDP-galactose 4-epimerase. Many microorganisms produce GDP-D- 
mannose as a precursor to exopolysaccharide and glycoprotein production, even though 
such organisms may not make L-ascorbic acid. This aspect of the present invention is 
discussed in detail below. 

15 Referring to Figs. 1 A-1C, at least some of the enzymes from glucose-6-phosphate 

to GDP-D-mannose are present in many organisms. In fact, the entire sequence is present 
in bacteria such as Azotobacter vinelancfii and Pseudomonas aeruginosa, and make up the 
early steps in the biosynthesis of the exopolysaccharide alginate. In this regard, it is 
possible that the only thing preventing these organisms from producing L-ascorbic acid 

20 could be the lack of GDP-D-mannose:GDP-L-galactose epimerase. The presence of PMI, 
PMM and GMP (see Fig. 1 A) in so many organisms is important for two reasons. First, 
these organisms themselves could serve as alternate hosts for L-ascorbic acid production, 
by building on the existing early pathway enzymes and adding the required cloned genes 
(the epimerase and possibly others). Second, the genes encoding PMI, PMM and GMP 

25 can be cloned into a new organism where, together with the cloned epimerase, they would 
encode the overall pathway from glucose-6-phosphate to GDP-L- galactose. 

In order to screen genomic DNA or cDNA libraries from different organisms and 
to isolate nucleic acid molecules encoding these enzymes such as the GDP-D- 
mannose:GDP-L-galactose epimerase, one can use any of a variety of standard molecular 

30 and biochemical techniques. For example, the GDP-D-mannose:GDP-L-galactose 
epimerase can be purified from an organism such as Prototheca, the N-terminal amino 
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acid sequence can be determined including, if necessary, the sequence of internal peptide 
fragments), and this information can be used to design degenerate primers for amplifying 
a gene fragment from the organism's DNA. This fragment would then be used to probe 
the library, and subsequently fragments that hybridize to the probe would be cloned in that 
5 organism or another suitable production organism. There is ample precedent for plant 
enzymes being expressed in an active form in bacteria, such as K coli. Alternatively, 
yeast are also a suitable candidate for developing a heterologous system for L-ascorbic 
acid production. 

It is to be understood that the present invention discloses a method comprising the 

10 use of a microorganism with an ability to produce commercially useful amounts of L- 
ascoibic acid in a fermentation process (i.e., preferably an enhanced ability to produce L- 
ascorbic acid compared to a wild-type microorganism cultured under the same 
conditions). This method is achieved by the genetic modification of one or more genes 
encoding a protein involved in an L-ascorbic add pathway which results in the production 

15 (expression) of a protein having an altered (e.g., increased or decreased) function as 
compared to the corresponding wild-type protein. Preferably, such genetic modification 
is achieved by recombinant technology. It will be appreciated by those of skill in the art 
that production of genetically modified plants or microorganisms having a particular 
altered function as described elsewhere herein (e.g., an enhanced ability to produce GDP- 

20 D-mannose:GDP-L-galactose epimerase), such as by transformation of the plant or 
microorganism with a nucleic acid molecule which encodes a particular enzyme, can 
produce many organisms meeting the given functional requirement, albeit by virtue of a 
variety of different genetic modifications. For example, different random nucleotide 
deletions and/or substitutions in a given nucleic acid sequence may all give rise to the 

25 same phenotypic result (e.g., decreased enzymatic activity of the protein encoded by the 
sequence). The present invention contemplates any such genetic modification which 
results in the production of a plant or microorganism having the characteristics set forth 
herein. 

A microorganism to be used in the fermentation method of the present invention 
30 is preferably a bacterium, a fungus, or a microalga which has been genetically modified 
according to the disclosure above. More preferably, a microorganism useful in the present 
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invention is a microalga which is capable of producing L~ascorbic acid, although the 
present invention includes microorganisms which are genetically engineered to produce 
L-ascorbic acid using the knowledge of the key components of the pathway and the 
guidance provided herein. Even more preferably, a microorganism useful in the present 
5 invention is an acid-tolerant microorganism, such as microalgae of the genera Prototheca 
and Chlorella. Arid-tolerant yeast and bacteria are also known in the art. Acid-tolerant 
microorganisms are discussed in detail below. Particularly preferred microalgae include 
microalgae of the genera, Prototheca and Chlorella, with Prototheca being most 
preferred. All known species of Prototheca produce L-ascorbic acid. Production of 

10 ascorbic acid by microalgae of the genera Prototheca and Chlorella is described in detail 
in U.S. Patent No. 5,792,63 1, issued August 1 1, 1998, and in U.S. Patent No. 5,900,370, 
issued May 4, 1999, both of which are incorporated herein by reference in their entirety. 
Preferred bacteria for use in the present invention include, but are not limited to, 
Azotobacter, Pseudomonas, and Escherichia, although acid-tolerant bacteria are more 

15 preferred. Preferred fungi for use in the present invention include yeast, and more 
preferably, yeast of the genus, Saccharomyces. A microorganism for use in the 
fermentation method of the present invention can also be referred to as a production 
organism. According to the present invention, microalgae can be referred to herein either 
as microorganisms or as plants. 

20 A preferred plant to genetically modify according to the present invention is 

preferably a plant suitable for consumption by animals, including humans. More 
preferably, such a plant is a plant that naturally produces L-ascorbic acid, although other 
plants can be genetically modified to produce L-ascorbic acid using the guidance provided 
herein. 

25 The L-ascorbic acid production pathways of the microalgae Prototheca and 

Chlorella pyrenoidosavMht addressed as specific embodiments of the present invention 
are described below. It will be appreciated that other plants and, in particular, other 
microorganisms, have similar L-ascorbic acid pathways and genes and proteins having 
similar structure and function within such pathways. It will also be appreciated that plants 

30 and microorganisms which do not naturally produce L-ascorbic acid can be modified 
according to the present invention to produce L-ascorbic acid. As such, the principles 
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discussed below with regard to Prototheca and Chlorella pyrenoidosa are applicable to 
other plants and microorganisms, including genetically modified plants and 
microorganisms. 

In one embodiment of the present invention, the action of an enzyme in the L- 
5 ascorbic acid production pathway is increased by amplification of the expression (Le., 
overexpression) of an enzyme in the pathway, and particularly, the GDP-D- 
mannose:GDP-L-galactose epimerase, homologues of the epimerase, and/or enzymes 
downstream of the epimerase. Overexpression of an enzyme can be accomplished, for 
example, by introduction of a recombinant nucleic acid molecule encoding the enzyme. 

10 It is preferred that the gene encoding an enzyme in the L-ascoibic acid production 
pathway be cloned under control of an artificial promoter. The promoter can be any 
suitable promoter that will provide a level of enzyme expression required to maintain a 
sufficient level ofL-ascorbic acid in the production organism. Preferred promoters are 
constitutive (rather than inducible) promoters, since the need for addition of expensive 

15 inducers is therefore obviated. The gene dosage (copy number) of a recombinant nucleic 
acid molecule according to the present invention can be varied according to the 
requirements for maximum product formation. In one embodiment, the recombinant 
nucleic acid molecule encoding a gene in the L-ascorbic acid production pathway is 
integrated into the chromosomes of the microorganism. 

20 It is another embodiment of the present invention to provide a microorganism 

having one or more enzymes in the L-ascorbic acid production pathway with improved 
affinity for its substrates. An enzyme with improved affinity for its substrates can be 
produced by any suitable method of genetic modification or protein engineering. For 
example, computer-based protein engineering can be used to design an epimerase protein 

25 with greater stability and better affinity for its substrate. See for example, Maulik et al., 
1997, Molecular Biotechnology: Therapeutic Applications and Strategies, Wiley-Liss, 
Inc., which is incorporated herein by reference in its entirety. 

Recombinant nucleic acid molecules encoding proteins in the L-ascorbic acid 
production pathway can be modified to enhance or reduce the function (i.e., activity) of 

30 the protein, as desired to increase L-ascorbic acid production, by any suitable method of 
genetic modification. For example, a recombinant nucleic acid molecule encoding an 
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enzyme can be modified by any method for inserting, deleting, and/or substituting 
nucleotides, such as by error-prone PCR. In this method, the gene is amplified under 
conditions that lead to a high frequency of misincorporation errors by the DNA 
polymerase used for the amplification. As a result, a high frequency of mutations are 
5 obtained in the PCR products. The resulting gene mutants can then be screened for 
enhanced substrate affinity, enhanced enzymatic activity, or reduced/increased inhibitory 
ability by testing the mutant genes for the ability to confer increased L-ascorbic acid 
production onto a test microorganism, as compared to a microorganism carrying the non- 
mutated recombinant nucleic acid molecule. 

10 Another embodiment of the present invention includes a microorganism in which 

competitive side reactions are blocked, including all reactions for which GDP-D-mannose 
is a substrate other than the production of L-ascorbic acid. In a preferred embodiment, 
a microorganism having complete or partial inactivation (decrease in the action of) of 
genes encoding enzymes which compete with the GDP-D-mannose:GDP-L-galactose 

1 5 epimerase for the GDP-D-mannose substrate is provided. Such enzymes include GDP-D- 
mannase and/or GDP-D-mannose-dehydrogenase. As used herein, inactivation of a gene 
can refer to any modification of a gene which results in a decrease in the activity (i.e., 
expression or function) of such a gene, including attenuation of activity or complete 
deletion of activity. 

20 As discussed above, a particularly preferred aspect of the method to produce L~ 

ascorbic acid by fermentation of a genetically modified microorganism of the present 
invention includes the step of culturing in a fermentation medium a microorganism having 
a genetic modification to increase the action of an epimerase that catalyzes conversion of 
GDP-D-mannose to GDP-L-galactose. According to the present invention, such an 

25 epimerase can include the endogenous GDP-D-mannose: GDP-L-galactose epimerase from 
the L-ascorbic acid pathway, described above, as well as any other epimerase that has 
structural homology at the primary (i.e., sequence) or tertiary (i.e., three dimensional) 
level, to a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase, or to a UDP-galactose 
4-epimerase. Such structural homology is discussed in detail below. Preferably, such an 

30 epimerase is capable of catalyzing the conversion of GDP-D-mannose to GDP-L- 
galactose. In one embodiment, the genetic modification includes transformation of the 
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microorganism with a recombinant nucleic acid molecule that expresses such an 
epimerase. 

Therefore, the epimerase encompassed in the method and organisms of the 
present invention includes the endogenous epimerase which operates in the naturally 
5 occurring ascorbic acid biosynthetic pathway (referred to herein as GDP-D- 
mannose:GDP-L-galactose epimerase), GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductases, and any other epimerase which is capable of catalyzing the conversion of 
GDP-D mannose to GDP-L-galactose and which is structurally homologous to a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4-epimerase. An 

10 epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose according 
the present invention can be identified by biochemical and functional characteristics as well 
as structural characteristics. For example, an epimerase according to the present invention 
is capable of acting on GDP-D-mannose as a substrate, and more particularly, such an 
epimerase is capable of catalyzing the conversion of GDP-D-mannose to GDP-L- 

15 galactose. It is to be understood that such capabilities need not necessarily be the normal 
or natural function of the epimerase as it acts in its endogenous (i.e., natural) environment. 
For example, GDP-4-keto-6-deoxy-D-mannose epimerase/reductase in its natural 
environment under normal conditions, catalyzes the conversion of GDP-D-mannose to 
GDP-L-fucose and does not act directly on GDP-D-mannose (See Fig.8A, B), however, 

20 such an epimerase is encompassed by the present invention for use in catalyzing the 
conversion of GDP-D-mannose to GDP-L-galactose for production of ascorbic acid, to 
the extent that it is capable o£ or can be modified to be capable of, catalyzing the 
conversion of GDP-D-mannose to GDP-L-galactose. Therefore, the present invention 
includes epimerases which have the desired enzyme activity for use in production of 

25 ascorbic acid, are capable of having such desired enzyme activity, and/or are capable of 
being modified or induced to have such desired enzyme activity. 

In one embodiment, an epimerase according to the present invention includes an 
epimerase that catalyzes the reaction depicted in Fig. 7. In another embodiment, an 
epimerase according to the present invention includes an epimerase that catalyzes the first 

30 of the reactions depicted in Fig. 8B. In one embodiment, an epimerase according to the 
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present invention binds to NADPH. In another embodiment, an epimerase according to 
the present invention is NADPH-dependent for enzyme activity. 

As discussed above, the present inventors have discovered that a key enzyme in 
L-ascorbic acid biosynthesis in plants and microorganisms is GDP-D-mannose:GDP-L- 
5 galactose epimerase (refer to Figs. 1 A-1C). One embodiment of the invention described 
herein is directed to the manipulation of this enzyme and structural homologues of this 
enzyme to increase L-ascorbic acid production in genetically engineered plants and/or 
microorganisms. More particularly, the GDP-D-mannose:GDP-L-galactose epimerase of 
the L-ascorbic acid pathway and GDP-4-keto-6-deoxy-D-mannose epimerase/reductases 

10 are believed to be structurally homologous at both the sequence and tertiary structure 
level; a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase is believed to be capable 
of functioning in the L-ascorbic add biosynthetic pathway, and a GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase or homologue thereof may be superior to a GDP-D- 
mannose-GDP-L-galactose epimerase for increasing L-ascoibic acid production in 

15 genetically engineered plants and/or microorganisms. Furthermore, the present inventors 
disclose the use of a nucleotide sequence encoding all or part of a GDP-4-keto-6-deoxy- 
D-mannose epimerase/reductase as a probe to identify the gene encoding GDP-D- 
mannose:GDP-L-galactose epimerase. Similarly, the present inventors disclose the use 
of a nucleotide sequence of the gene encoding GDP-4-keto-6-deoxy-D-mannose 

20 epimerase/reductase to design oligonucleotide primers for use in a PCR-based strategy for 
identifying and cloning a gene encoding GDP-D-mannose:GDP-L-galactose epimerase. 

Without being bound by theory, the present inventors believe that the following 
evidence supports the novel concept that the GDP-D-mannose:GDP-L-galactose 
epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductases have significant 

25 structural homology at the level of sequence and/or tertiary structure, and that the GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductases and/or homologues thereof would be 
useful for production of ascorbic acid and/or for isolating the endogenous GDP-D- 
mannose:GDP-L-galactose epimerase. 

Although prior to the present invention, it was not known that the GDP-D- 

30 mannose:GDP-L-galactose epimerase enzyme (also known as GDP-D-mannose-3,5- 
epimerase) plays a critical role in L-ascorbic acid biosynthesis, this enzyme was previously 
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described to catalyze the overall reversible reaction between GDP-D-mannose and GDP- 
L-galactose (Barber, 1971, Arch. Biochem. Biophys. 147:619-623; Barber, 1975, Arch. 
Biochem. Biophys. 167:718-722; Barber, 1979,7. Biol Chem. 254:7600-7603; Hebda, 
et al., 1979, Arch Biochem. Biophys. 194:496-502; Barber and Hebda, 1982, Meth 
5 EnzymoL, 83:522-525). Despite these studies, GDP-D-mannose:GDP-L-galactose 
epimerase has never been well characterized nor has the gene encoding this enzyme been 
cloned and sequenced. Since the original work by BaAer, GDP-D-mannose:GDP-L- 
galactose epimerase activity has been detected in the colorless microalga Prototheca 
moriformis by the assignee of the present application, and inArabidopsis thaliana and pea 

10 embryonic axes (Wheeler, et al., 1998, ibid). 

Barber (1979, J. BioL Chem. 254:7600-7603) proposed a mechanism for GDP-D- 
mannose:GDP-L-galactose epimerase partially purified from the green microalga 
Chlorella pyrenoxdosa. The overall conversion of GDP-D-mannose to GDP-L-galactose 
was proposed to proceed by oxidation of the hexosyl moiety at C-4 to a keto intermediate, 

15 ene-diol formation, and inversion of the configurations at C-3 and C-5 upon rehydration 
of the double bonds and stereospecific reduction of the keto group. The proposed 
mechanism is depicted in Fig. 7. 

Based on Barber's work, Feingold and Avigad (1980, In The Biochemistry of 
Plants, Vol. 3: Cart>ohydrates; Structure and Function, P.K. Stompf and E.E. Conn, eds., 

20 Academic Press, NY) elaborated further on the proposed mechanism for GDP-D- 
mannose:GDP-L-galactose epimerase. This mechanism is based on the assumption that 
the epimerase contains tighdy bound NAD*, and transfer of a hydride ion from C-4 of the 
substrate (GDP-D-mannose) to enzyme-associated NAD + converts the enzyme to the 
reduced (NADH)form, generating enzyme-bound GDP-4-keto-D-mannose. The latter 

25 would then undergo epimerization by an ene-diol mechanism. The final product (GDP-L- 
galactose) would be released from the enzyme after stereospecific transfer of the hydride 
ion originally removed from C-4, simultaneously regenerating the oxidized form of the 
enzyme. 

L-fucose (6-deoxy-L-galactose) is a component of bacterial lipopolysaccharides, 
30 mammalian and plant glycoproteins and polysaccharides of plant cell walls. L-fucose is 
synthesized de novo from GDP-D-mannose by the sequential action of GDP-D-mannose- 
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4,6-dehydratase (an NAD(P)-dependent enzyme), and a bifunctional GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase (NADPH-dependent), also referred to in scientific 
literature as GDP-fucose synthetase (Rizzi, et al., 1998, Structure 6:1453-1465; Somers, 
et aL, 1998, Structure 6:1601-1612). This pathway for L-fucose biosynthesis appears to 
5 be ubiquitous (Rizzi, et al., 1998, Structure 6: 1453-1465). The mechanisms for GDP-D- 
mannose-4,6-dehydratase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductase are 
shown in Fig. 8A, B (Chang, et al., 1988, J. Biol. Chem. 263:1693-1697; Barber, 1980, 
Plant Physiol. 66:326-329). 

Comparison of Figs. 7 and 8A, B reveals that Barber's proposed mechanism for 
10 GDP-D-mannose : GDP-L-gal actose epimerase is analogous to the reaction mechanism for 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase. The same mechanism has also 
been demonstrated for the epimerization reaction that occurs in the biosynthesis of two 
TDP-6-deoxy hexoses, TDP-L-rhamnose and TDP-6-deoxy-L-talose, from TDP-D- 
glucose (Liu and Thorson, 1994, Ann. Rev. Microbiol. 48:223-256). In the latter cases, 
15 however, the final reduction at C-4 is catalyzed by NADPH-dependent reductases that are 
separate from the epimerase enzyme. These reductases have opposite stereospecificity, 
providing either TDP-L-rhamnose or TDP-6-deoxy-L-talose (Liu and Thorson, 1994, 
Ann. Rev. Microbiol. 48:223-256). 

In all of the mechanisms described above, NAD(P)H is required for the final 
20 reduction at C-4 (refer to Fig. 8B). In the work of Hebda, et al. (1979, Arch Biochem. 
Biophys. 194:496-502), it was reported that GDP-D-mannose:GDP-L-galactose 
epimerase from C. pyrenoidosa did not require NAD, NADP or NADH for activity. 
Strangely, NADPH was not tested. Based on the analogous mechanisms shown in Figs. 
7 and 8A, B, the present inventors believe that it is likely that GDP-D-mannose:GDP-L- 
25 galactose epimerase from C. pyrenoidosa requires NADPH for the final reduction step. 
Why activity was detected in vitro without NADPH addition is not known, but tight 
♦binding of NADPH to the enzyme could explain this observation. On the other hand, if 
the proposed mechanism of Feingold and Avigad (1980, in The Biochemistry of Plants, 
Vol. 3, p. 101-170: Carbohydrates; Structure and Function, P.K. Stompf and E.E. Conn, 
30 ed., Academic Press, NY) is correct, the reduced enzyme-bound cofactor generated in the 
first oxidation step of the epimerase reaction would serve as the source of electrons for 
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the final reduction of the keto group at C-4 back to the alcohol. Thus no addition of 
exogenous reduced cofactor would be required for activity in vitro. 

Recently, a human gene encoding the bifiinctional GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase was cloned and sequenced (Tonetti, et al. 9 1996, J. Biol. 
5 Chem. 271-27274-27279). This amino add sequence of the human GDP-4-keto-6-deoxy- 
D-mannose epimerase/reductase shows significant homology (29% identity) to the E. coli 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase (Tonetti, et al., 1998, Acta Cryst. 
D54:684-686; Somers, et al., 1998, Structure 6:1601-1612, both of which are 
incorporated herein by reference in their entireties). Tonetti et al. and Somers et al. 

1 0 additionally disclosed the tertiary (three dimensional) structure of the E. coli GDP-4-keto- 
6-deoxy-D-mannose epimerase/reductase (also known as GDP-fucose synthetase), and 
noted significant structural homology with another epimerase, UDP-galactose 4-epimerase 
(GalE). These epimerases also share significant homology at the sequence level. Since 
no gene encoding a GDP-D-mannose:GDP-L-gaIactose epimerase has been cloned and 

1 5 sequenced, homology with genes encoding GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductases or with genes encoding a UDP-galactose 4-epimerase has not been 
demonstrated. However, based on the similarity of the reaction products for GDP-D- 
mannose:GDP-L-galactose epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase (i.e., GDP-L-galactose and GDP-6-deoxy-L-galactose [i.e., GDP-L-fiicose], 

20 respectively) and the common catalytic mechanisms (Figs. 7 and 8A, B) the present 
inventors believe that the genes encoding the enzymes will have a high degree of sequence 
homology, as well as tertiary structural homology. 

Significant structural homology between GDP-D-mannose:GDP-L-galactose 
epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductases may allow a GDP- 

25 4-keto-6-deoxy-D-mannose epimerase/reductase, or a homologue thereof; to function in 
the L-ascorbic acid biosynthetic pathway, and a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase could potentially be even better than a GDP-D-mannose-GDP-L- 
galactose epimerase for increasing L-ascorbic acid production in genetically engineered 
plants and/or microorganisms. Furthermore, a nucleotide sequence encoding all or part 

30 of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase can be used as a probe to 
identify the gene encoding GDP-D-mannose:GDP-L-galactose epimerase. Likewise, the 
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nucleotide sequence of the gene encoding GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase can be used to design oligonucleotide primers for use in a PCR-based strategy 
for identifying and cloning a gene encoding GDP-D-mannose:GDP-L-galactose 
epimerase. 

5 The ability to substitute GDP-4-keto-6-D-mannose epimerase/reductase for GDP- 

D-mannose:GDP-L~galactose epimerase to enhance L-ascorbic acid biosynthesis in plants 
or microorganisms depends on the ability of GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase to act directly on GDP-D-mannose to form GDP-L-galactose. Evidence 
supporting this possibility already exists. Arabidopsis thaliana murl mutants are defective 

10 in GDP-D-mannose-4,6-dehydratase activity (Bonin, et al., 1997, Proc. Natl Acad ScL 
94:2085-2090). These mutants are thus blocked in GDP-L-fucose biosynthesis, and 
consequently have less than 2% of the normal amounts of L-fucose in the primary cell 
walls of aerial portions of the plant (Zablackis, et al., 1996, Science 272:1808-1810). The 
murl mutants are more brittle than wild-type plants, are slightly dwarfed and have an 

15 apparently normal life cycle (Zablackis, et al., 272: 1808-1810). When murl mutants are 
grown in the presence of exogenous L-fucose, the L-fucose content in the plant is restored 
to the wild-type state (Bonin, et al., 1997, Proc. Natl Acad ScL 94:2085-2090). It was 
discovered (Zablackis, et al., 1996, Science 272:1808-1810) that murl mutants contain, 
in the hemicellulose xyloglucan component of the primary cell wall, L-galactose in place 

20 of the normal L-fucose. L-galactose is not normally found in the xyloglucan component, 
but in murl mutants L-galactose partly replaces the terminal L-fucosyl residue. Bonin, et 
al. (1997, Proc. Natl. Acad ScL 94:2085-2090) hypothesized that in the absence of a 
functional GDP-D-mannose-4, 6- dehydratase in the wrl mutants, the GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase normally involved in L-fiicose synthesis may be 

25 able to use GDP-D-mannose directly, forming GDP-L-galactose. Another possibility, 
however, is that the enzymes involved in L-ascorbic acid biosynthesis in A. thaliana are 
responsible for forming GDP-L-galactose in the murl mutant. If this were true, it would 
suggest that in the wild-type plant, some mechanism exists that prevents GDP-L-galactose 
formed in the L-ascorbic acid pathway from entering cell wall biosynthesis and 

30 substituting for (competing with) GDP-L-fiicose for incorporation into the xyloglucan 
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component (since L-galactose is not present in the primary cell wall of the wild-type 
plant). 

Because of the similar reaction mechanisms of GDP-D-mannose:GDP-L~galactose 

epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductase, and because of the 

5 evidence that GDP-4-keto-6-deoxy-D-mannose epimerase/reductase can act directly on 

GDP-D-mannose to form GDP-L-galactose, the present inventors believe that genes 

encoding all epimerases and epimerase/reductases that act on GDP-D-mannose have high 

homology. As such, one aspect of the present invention relates to the use of any 

epimerase (and nucleic acid sequences encoding such epimerase) having significant 

10 homology (at the primaiy, secondary and/or tertiary structure level) to a GDP-4-keto-6- 

deoxy-D-mannose epimerase/reductase or to a UDP-galactose 4-epimerase for the 

purpose of improving the biosynthetic production of L-ascorbic acid. 
* 

Therefore, as described above, one embodiment of the present invention relates 
to a method for producing ascorbic acid or esters thereof in a microorganism, which 

15 includes culturing a microorganism having a genetic modification to increase the action 
of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose. Also 
included in the present invention are genetically modified microorganisms and plants in 
which the genetic modification increases the action of an epimerase that catalyzes 
conversion of GDP-D-mannose to GDP-L-galactose. 

20 According to the present invention, an increase in the action of the GDP-D- 

mannose:GDP-L-galactose epimerase in the L-ascorbic acid production pathway can be 
achieved by genetic modifications which include, but are not limited to overexpression of 
the GDP-D-mannose:GDP-L-galactose epimerase gene, a homologue of such gene, or of 
any recombinant nucleic acid sequence encoding an epimerase that is homologous in 

25 primary (nucleic acid or amino acid sequence) or tertiary (three dimensional protein) 
structure to a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 
4-epimerase, such as by overexpression of a recombinant nucleic acid molecule encoding 
the epimerase gene or a homologue thereof, and/or by mutation of the endogenous or 
recombinant gene to enhance expression of the gene. 

30 According to the present invention, an epimerase that has a tertiary structure that 

is homologous to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
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reductase is an epimerase that has a tertiary structure that substantially conforms to the 
tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented 
by the atomic coordinates having Brookhaven Protein Data Bank Accession Code lbws 
(Table 12). In another embodiment, an epimerase that has a tertiary structure that is 
5 homologous to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase is an epimerase that has a tertiary structure that substantially conforms to the 
tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented 
by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1GFS. 
As used herein, a "tertiary structure" or "three dimensional structure" of a protein, such 

10 terms being interchangeable, refers to the components and the manner of arrangement of 
the components in three dimensional space to constitute the protein. The use of the term 
"substantially conforms" refers to at least a portion of a tertiary structure of an epimerase 
which is sufficiently spatially similar to at least a portion of a specified three dimensional 
configuration of a particular set of atomic coordinates (e.g., those represented by 

15 Brookhaven Protein Data Bank Accession Code lbws) to allow the tertiary structure of 
at least said portion of the epimerase to be modeled or calculated (i.e., by molecular 
replacement) using the particular set of atomic coordinates as a basis for estimating the 
atomic coordinates defining the three dimensional configuration of the epimerase. 

More particularly, a tertiary structure that substantially conforms to a given set of 

20 atomic coordinates is a structure having an average root-mean-square deviation (RMSD) 
of less than about 2.5 A, and more preferably, less than about 2 A, and, in increasing 
preference, less than about 1.5 A, less than about 1 A, less than about 0.5 A, and most 
preferably, less than about 0.3 A, over at least about 25% of the Ca positions as 
compared to the tertiaiy structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 

25 reductase represented by the atomic coordinates having Brookhaven Protein Data Bank 
Accession Code lbws. In other embodiments, a structure that substantially conforms to 
a given set of atomic coordinates is a structure wherein such structure has the recited 
average root-mean-square deviation (RMSD) value over at least about 50% of the Ca 
positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 

30 epimerase/reductase represented by the atomic coordinates having Brookhaven Protein 
Data Bank Accession Code lbws, and in another embodiment, such structure has the 
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recited average root-mean-square deviation (RMSD) value over at least about 75% of the 
Ca positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by the atomic coordinates having Brookhaven Protein 
Data Bank Accession Code lbws, and in another embodiment, such structure has the 
5 recited average root-mean-square deviation (RMSD) value over about 100% of the Ca 
positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by the atomic coordinates having Brookhaven Protein 
Data Bank Accession Code lbws. Methods to calculate RMSD values are well known 
in the art. Various software programs for determining the tertiary structural homology 

10 between one or more proteins are known in the art and are publicly available, such as 
QUANTA (Molecular Simulations Inc.). 

A preferred epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose according to the method and genetically modified organisms of the present 
invention includes an epimerase that comprises a substrate binding site having a tertiary 

15 structure that substantially conforms to the tertiary structure of the substrate binding site 
of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic 
coordinates having Brookhaven Protein Data Bank Accession Code lbws. Preferably, the 
tertiary structure of the substrate binding site of the epimerase has an average root-mean- 
square deviation (RMSD) of less than about 2.5 A, and more preferably, less than about 

20 2 A, and, in increasing preference, less than about 1.5 A, less than about 1 A, less than 
about 0.5 A, and most preferably, less than about 0.3 A, over at least about 25% of the 
Ca positions as compared to the tertiary structure of the substrate binding site of a GDP- 
4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates 
having Brookhaven Protein Data Bank Accession Code lbws. In other embodiments, the 

25 tertiary structure of the substrate binding site of the epimerase has the recited average 
root-mean-square deviation (RMSD) value over at least about 50% of the Ca positions 
as compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code lbws, and in another embodiment, the 

30 tertiary structure of the substrate binding site of the epimerase has the recited average 
root-mean-square deviation (RMSD) value over at least about 75% of the Ca positions 
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as compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code lbws, and in another embodiment, the 
tertiary structure of the substrate binding site of the epimerase has the recited average 
5 root-mean-square deviation (RMSD) value over about 100% of the Cot positions as 
compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6-deoxy- 
D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code lbws. The tertiary structure of the 
substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase 

10 represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code lbws is discussed in detail in Rizzi et al., 1998, ibid. Additionally, the tertiary 
structure of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase represented by the atomic coordinates having Brookhaven Protein Data Bank 
Accession Code 1GFS is discussed in detail in Somers et al., 1998, ibid. 

15 Another preferred epimerase according to the present invention includes an 

epimerase that comprises a catalytic site having a tertiary structure that substantially 
conforms to the tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase represented by the atomic coordinates having Brookhaven 
Protein Data Bank Accession Code lbws. Preferably, the tertiary structure of the 

20 catalytic site of the epimerase has an average root-mean-square deviation (RMSD) of less 
than about 2.5 A, and more preferably, less than about 2 A, and, in increasing preference, 
less than about 1.5 A, less than about 1 A, less than about 0.5 A, and most preferably, less 
than about 0.3 A, over at least about 25% of the Ca positions as compared to the tertiary 
structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase 

25 represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code lbws. In other embodiments, the tertiary structure of the catalytic site of the 
epimerase has the recited average root-mean-square deviation (RMSD) value over at least 
about 50% of the Ca positions as compared to the tertiary structure of the catalytic site 
of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic 

30 coordinates having Brookhaven Protein Data Bank Accession Code lbws, and in another 
embodiment, the tertiary structure of the catalytic site of the epimerase has the recited 
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average root-mean-square deviation (RMSD) value over at least about 75% of the Ca 
positions as compared to the tertiary structure of the catalytic site of a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having 
Brookhaven Protein Data Bank Accession Code Ibws, and in another embodiment, the 
5 tertiary structure of the catalytic site of the epimerase has the recited average root-mean- 
square deviation (RMSD) value over 100% of the Ca positions as compared to the 
tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase represented by the atomic coordinates having Brookhaven Protein Data Bank 
Accession Code lbws. 

10 In one embodiment, an epimerase encompassed by the present invention includes 

an epimerase that has a catalytic site which includes amino acid residues: serine, tyrosine 
and lysine. In a preferred embodiment, the tertiary structure positions of the amino acid 
residues serine, tyrosine and lysine substantially conform to the tertiary structure position 
of residues Serl07, Tyrl36 and Lysl40, respectively, as represented by atomic 

15 coordinates in Brookhaven Protein Data Bank Accession Code lbws. The tertiary 
structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase 
represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code lbws is discussed in detail in Rizzi et al, 1998, ibid. Additionally, the tertiary 
structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase 

20 represented by the atomic coordinates having Brookhaven Protein Data Bank Accession 
Code 1GFS is discussed in detail in Somers et al., 1998, ibid. 

In an even more preferred embodiment, the above definition of "substantially 
conforms" can be further defined to include atoms of amino acid side chains. As used 
herein, the phrase "common amino acid side chains" refers to amino acid side chains that 

25 are common to both the structures which substantially conforms to a given set of atomic 
coordinates and the structure that is actually represented by such atomic coordinates. 
Preferably, a tertiary structure that substantially conforms to a given set of atomic 
coordinates is a structure having an average root-mean-square deviation (RMSD) of less 
than about 2.5 A, and more preferably, less than about 2 A, and, in increasing preference, 

30 less than about 1.5 A, less than about 1 A, less than about 0.5 A, and most preferably, less 
than about 0.3 A over at least about 25% of the common amino acid side chains as 
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compared to the tertiaiy structure represented by the given set of atomic coordinates. In 
another embodiment, a structure that substantially conforms to a given set of atomic 
coordinates is a structure having the recited average root-mean-square deviation (RMSD) 
value over at least about 50% of the common amino acid side chains as compared to the 
5 tertiary structure represented by the given set of atomic coordinates, and in another 
embodiment, such structure has the recited average root-mean-square deviation (RMSD) 
value over at least about 75% of the common amino acid side chains as compared to the 
tertiary structure represented by the given set of atomic coordinates, and in another 
embodiment, such a structure has the recited average root-mean-square deviation 
10 (RMSD) value over 100% of the common amino acid side chains as compared to the 
tertiary structure represented by the given set of atomic coordinates. 

A tertiary structure of an epimerase which substantially conforms to a specified set 
of atomic coordinates can be modeled by a suitable modeling computer program such as 
MODELER (A. Sali and Ti. Blundell, J. Mol Biol, vol. 234:779-815, 1993 as 
15 implemented in the Insight II Homology software package (Insight II (97.0), MSI, San 
Diego)), using information, for example, derived from the following data: (1) the amino 
acid sequence of the epimerase; (2) the amino acid sequence of the related portion(s) of 
the protein represented by the specified set of atomic coordinates having a three 
dimensional configuration; and, (3) the atomic coordinates of the specified three 
20 dimensional configuration. Alternatively, a tertiary structure of an epimerase which 
substantially conforms to a specified set of atomic coordinates can be modeled using data 
generated from analysis of a crystallized structure of the epimerase. A tertiary structure 
of an epimerase which substantially conforms to a specified set of atomic coordinates can 
also be calculated by a method such as molecular replacement. Methods of molecular 
25 replacement are generally known by those of skill in the art (generally described in 
Brunger, Meth Enzym., vol. 276, pp. 558-580, 1997; Navaza and Saludjian, Meth 
Enzym., vol. 276, pp. 581-594, 1997; Tong and Rossmann, Meth Enzym. y vol. 276, pp. 
594-61 1, 1997; and Bentley, Meth Enzym. t vol. 276, pp. 61 1-619, 1997, each of which 
are incorporated by this reference herein in their entirety) and are performed in a software 
30 program including, for example, XPLOR (Brunger, et al., Science, vol. 235, p. 458, 
1987). In addition, a structure can be modeled using techniques generally described by, 
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for example, Sali, Current Opinions in Biotechnology, vol. 6, pp. 437-451, 1995, and 
algorithms can be implemented in program packages such as Homology 95.0 (in the 
program Insight n, available from Biosym/MSI, San Diego, CA). Use of Homology 95.0 
requires an alignment of an amino acid sequence of a known structure having a known 
5 three dimensional structure with an amino acid sequence of a target structure to be 
modeled. The alignment can be a pairwise alignment or a multiple sequence alignment 
including other related sequences (for example, using the method generally described by 
Rost, Meth. Enzymol, vol. 266, pp. 525-539, 1996) to improve accuracy. Structurally 
conserved regions can be identified by comparing related structural features, or by 

10 examining the degree of sequence homology between the known structure and the target 
structure. Certain coordinates for the target structure are assigned using known structures 
from the known structure. Coordinates for other regions of the target structure can be 
generated from fragments obtained from known structures such as those found in the 
Protein Data Bank maintained by Brookhaven National Laboratory, Upton, NY. 

15 Conformation of side chains of the target structure can be assigned with reference to what 
is sterically allowable and using a library of rotamers and their frequency of occurrence 
(as generally described in Ponder and Richards, 1 Mol Biol, vol. 193, pp. 775-791, 
1987). The resulting model of the target structure, can be refined by molecular mechanics 
(such as embodied in the program Discover, available from Biosym/MSI) to ensure that 

20 the model is chemically and conformational^ reasonable. 

According to the present invention, an epimerase that has a nucleic acid sequence 
that is homologous at the primary structure level (i.e., is a homologue of) to a nucleic acid 
sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP- 
galactose 4-epimerase includes any epimerase encoded by a nucleic acid sequence that is 

25 at least about 15%, and preferably at least about 20%, and more preferably at least about 
25%, and even more preferably, at least about 30% identical to a nucleic acid sequence 
encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4- 
epimerase, and preferably to a nucleic acid sequence selected from the group consisting 
of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 or SEQ ID NO:9. 

30 Similarly, an epimerase that has an amino acid sequence that is homologous to an amino 
acid sequence of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP- 
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galactose 4-epimerase includes any epimerase having an amino acid sequence that is at 
least about 15%, and preferably at least about 20%, and more preferably at least about 
25%, and even more preferably, at least about 30% identical to an amino acid sequence 
of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4- 

5 epimerase, and preferably to an amino acid sequence selected from the group consisting 
of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 or SEQ ID NO: 10. 

According to one embodiment of the present invention, homology or percent 
identity between two or more nucleic acid or amino acid sequences is performed using 
methods known in the art for aligning and/or calculating percentage identity. To compare 

10 the homology/percent identity between two or more sequences as set forth above, for 
example, a module contained within DNASTAR (DNASTAR, Inc., Madison, Wisconsin) 
can be used. In particular, to calculate the percent identity between two nucleic acid or 
amino acid sequences, the Lipman-Pearson method, provided by the MegAlign module 
within the DNASTAR program, is preferably used, with the following parameters, also 

15 referred to herein as the Lipman-Pearson standard default parameters: 

(1) Ktuple = 2; 

(2) Gap penalty = 4; 

(3) Gap length penalty = 12. 

Using the Lipman-Pearson method with these parameters, for example, the percent 
20 identity between the amino acid sequence for E. coli GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase (SEQ ID NO:4) and human GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase (FX) (SEQ ID NO:6) is 27.7%, which is comparable to the 27% 
identity described for these enzymes in Tonetti et al., 1998, Acta Cryst. D54:684-686. 
According to another embodiment of the present invention, to align two or more 
25 nucleic acid or amino acid sequences, for example to generate a consensus sequence or 
evaluate the similarity at various positions between such sequences, a CLUSTAL 
alignment program (e.g., CLUSTAL, CLUSTAL V, CLUSTAL W), also available as a 
module within the DNASTAR program, can be used using the following parameters, also 
referred to herein as the CLUSTAL standard default parameters: 
30 Multiple Alignment Param eters fi e r for more than 2 sequences): 

(1) Gap penalty = 10; 
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(2) Gap length penalty = 10; 
Pairwise Alignment Parameters (i e„ for two sequences): 

(1) Ktuple=l; 

(2) Gap penalty = 3; 
5 (3) Window =5; 

(4) Diagonals saved = 5. 

According to the present invention, a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase can be a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase from any 
organism, including Arabidopsis thaliana, Escherichia coli 9 and human. A nucleic acid 

10 sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase from 
Arabidopsis thaliana is represented herein by SEQ ID NO:l. SEQ ID NO:l encodes a 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase having an amino acid sequence 
represented herein*as SEQ ID NO:2. A nucleic acid sequence encoding a GDP-4-keto-6- 
deoxy-D-mannose epimerase/reductase from Escherichia coli is represented herein by 

15 SEQ ID NO:3 . SEQ ED NO:3 encodes a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase having an amino acid sequence represented herein as SEQ ID NO:4. A nucleic 
acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase from 
homo sapiens is represented herein by SEQ ID NO:5. SEQ ID NO:5 encodes a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase having an amino add sequence represented 

20 herein as SEQ ID NO:6. 

According to the present invention, a UDP-galactose 4-epimerase can be a UDP- 
galactose 4-epimerase from any organism, including Escherichia coli and human. A 
nucleic acid sequence encoding a UDP-galactose 4-epimerase from Escherichia coli is 
represented herein by SEQ ID NO;7. SEQ ID NO:7 encodes a UDP-galactose 4- 

25 epimerase having an amino acid sequence represented herein as SEQ ID NO:8. A nucleic 
acid sequence encoding a UDP-galactose 4-epimerase from homo sapiens is represented 
herein by SEQ ID NO:9. SEQ ID NO:9 encodes a UDP-galactose 4-epimerase having 
an amino acid sequence represented herein as SEQ ID NO: 10. 

In a preferred embodiment, an epimerase encompassed by the present invention 

30 has an amino add sequence that aligns with the amino acid sequence of SEQ ID NO: 1 1, 
for example using a CLUSTAL alignment program, wherein amino acid residues in the 
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amino acid sequence of the epimerase align with 100% identity with at least about 50% 
of non-Xaa residues in SEQ ID NO:l 1, and preferably at least about 75% of non-Xaa 
residues in SEQ ID NO:l 1, and more preferably, at least about 90% of non-Xaa residues 
in SEQ ID NO:ll, and even more preferably 100% of non-Xaa residues in SEQ ID 
5 NO: 11. The percent identity of residues aligning with 100% identity with non-Xaa 
residues can be simply calculated by dividing the number of 100% identical matches at 
non-Xaa residues in SEQ ID NO: 1 1 by the total number of non-Xaa residues in SEQ ID 
NO: 11. A preferred nucleic acid sequence encoding an epimerase encompassed by the 
present invention include a nucleic add sequence encoding an epimerase having an amino 

10 acid sequence with the above described identity to SEQ ID NO: 11. Such an alignment 
using a CLUSTAL alignment program is based on the same parameters as previously 
disclosed herein. SEQ ID NO: 11 represents a consensus amino acid sequence of an 
epimerase which was derived by aligning at least portions of amino acid sequences SEQ 
ED NO:4, SEQ ID NO:6 and SEQ ID NO:8, as described in Somers et al., 1 998, Structure 

15 6:1 60 1-161 2, and can be approximately duplicated using CLUSTAL. 

In another embodiment, an epimerase encompassed by the present invention 
includes an epimerase that has a catalytic site which includes amino acid residues: serine, 
tyrosine and lysine. Preferably, such serine, tyrosine and lysine residues are located at 
positions in the epimerase amino acid sequence which align using a CLUSTAL alignment 

20 program with positions Serl05, Tyrl34 and Lysl38 of consensus sequence SEQ ID 
NO: 11, with positions Serl09, Tyrl38 and Lysl42 of sequence SEQ ID NO:2, with 
positions Serl07, Tyrl36 and Lysl40 of SEQ ID NO:4, with positions Serll4, Tyrl43 
and Lysl47 of sequence SEQ ID NO:6, with positions Serl24, Tyrl49 and Lysl53 of 
sequence SEQ ID NO:8 or with positions Serl32, Tyrl57 and Lysl61 of sequence SEQ 

25 IDNO:10. 

In another embodiment, an epimerase that has an amino acid sequence that is 
homologous to an amino acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase includes any epimerase that has an amino acid motif: Gly-Xaa-Xaa- 
Gty-Xaa-Xaa-Gfy, which is found, for example in positions 8 through 14 of the consensus 
30 sequence SEQ ID NO: 1 1, in positions 12 through 1 8 of SEQ ID NO:2, in positions 10 
through 16 of SEQ ID NO:4, in positions 14 through 20 of SEQ ID NO:6, in positions 
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7 through 13 of SEQ ID N0:8, and in positions 9 through 15 of SEQ ID NO:10. Such 
a motif can be identified by its alignment with the same motif in the above-identified amino 
acid sequences using a CLUSTAL alignment program. Preferably, such motif is located 
within the first 25 N-terminal amino acids of the amino acid sequence of the epimerase. 
5 In yet another embodiment, an epimerase encompassed by the present invention 

includes an epimerase that has a substrate binding site which includes amino acid residues 
that align using a CLUSTAL alignment program with at least 50% of amino acid 
positions Asnl77, Serl78, Argl87, Arg209, Lys283, Asnl65, Serl07, Serl08, Cysl09, 
Asnl33, Tyrl36 and Hisl79 of SEQ ID NO:4. Alignment with positions Serl07, Tyrl36, 

10 Asnl65, Arg209, is preferably with 100% identity (i.e., exact match of residue, under 
parameters for alignment). 

In another embodiment of the present invention, an epimerase encompassed by the 
present invention comprises at least 4 contiguous amino acid residues having 100% 
identity with at least 4 contiguous amino acid residues of an amino acid sequence selected 

15 from the group of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 or SEQ 
ID NO: 10, as determined using a Lipman-Pearson method with Lipman-Pearson standard 
default parameters or by comparing an alignment using a CLUSTAL program with 
CLUSTAL standard default parameters. According to the present invention, the term 
"contiguous" means to be connected in an unbroken sequence. For a first sequence to 

20 have "100% identity" with a second sequence means that the first sequence exactly 
matches the second sequence with no gaps between nucleotides or amino acids. 

In another embodiment of the present invention, an epimerase encompassed by the 
present invention is encoded by a nucleic acid sequence that comprises at least 12 
contiguous nucleic acid residues having 100% identity with at least 12 contiguous nucleic 

25 acid residues of a nucleic acid sequence selected from the group of SEQ ID NO:l, SEQ 
ID NO:3, SEQ ID NO:5, SEQ ID NO:7 or SEQ ID NO: 10, as determined using a 
Lipman-Pearson method with Lipman-Pearson standard default parameters or by 
comparing an alignment using a CLUSTAL program with CLUSTAL standard default 
parameters. 

30 In another embodiment of the present invention, an epimerase encompassed by the 

present invention is encoded by a nucleic acid sequence that hybridizes under stringent 
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hybridization conditions to a nucleic acid sequence selected from the group of SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 or SEQ ID NO:9. As used herein, 
stringent hybridization conditions refer to standard hybridization conditions under which 
nucleic acid molecules are used to identify similar nucleic acid molecules. Such standard 
5 conditions are disclosed, for example, in Sambrook et al., Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Labs Press, 1989. Sambrook et al., ibid., is 
incorporated by reference herein in its entirety (see specifically, pages 9.31-9.62). In 
addition, formulae to calculate the appropriate hybridization and wash conditions to 
achieve hybridization permitting varying degrees of mismatch of nucleotides are disclosed, 

10 for example, in Meinkoth et al., 1984, Anal. Biochem. 138, 267-284; Meinkoth et al., 
ibid., is incorporated by reference herein in its entirety. 

More particularly, stringent hybridization and washing conditions, as referred to 
herein, refer to conditions which permit isolation of nucleic acid molecules having at least 
about 70% nucleic acid sequence identity with the nucleic acid molecule being used to 

15 probe in the hybridization reaction, more particularly at least about 75%, and most 
particularly at least about 80%. Such conditions will vary, depending on whether 
DNA:RNA or DNA:DNA hybrids are bring formed. Calculated melting temperatures for 
DNA:DNA hybrids are 10°C less than for DNA:RNA hybrids. In particular 
embodiments, stringent hybridization conditions for DNA:DNA hybrids include 

20 hybridization at an ionic strength of 6X SSC (0.9 M Na + ) at a temperature of between 
about 20°C and about 35°C, more preferably, between about 28°C and about 40°C, and 
even more preferably, between about 35°C and about 45°C. In particular embodiments, 
stringent hybridization conditions for DNARNA hybrids include hybridization at an ionic 
strength of 6X SSC (0.9 M Na + ) at a temperature of between about 30°C and about 

25 45°C, more preferably, between about 38°C and about 50°C, and even more preferably, 
between about 45 °C and about 55 °C. These values are based on calculations of a melting 
temperature for molecules larger than about 100 nucleotides, 0% formamide and a G + 
C content of about 40%. Alternatively, T m can be calculated empirically as set forth in 
Sambrook et al., supra, pages 9.31 to 9.62. 

30 In another embodiment of the present invention, an epimerase encompassed by the 

present invention is encoded by a nucleic acid sequence that comprises a nucleic acid 
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sequence selected from the group of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:5, SEQ 
ID NO:7 or SEQ ID NO:9 or a fragment thereof wherein the fragment encodes a protein 
that is capable of catalyzing the conversion of GDP-D-mannose to GDP-L-galactose, such 
as under physiological conditions. In another embodiment, an epimerase encompassed by 
5 the present invention comprises an amino acid sequence selected from the group of SEQ 
ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10 or a fragment 
thereof wherein the fragment is capable of catalyzing the conversion of GDP-D-mannose 
to GDP-L-galactose. It is to be understood that the nucleic acid sequence encoding the 
amino acid sequences identified herein can vaiy due to degeneracies. As used herein, 

10 nucleotide degeneracies refers to the phenomenon that one amino acid can be encoded by 
different nucleotide codons. 

One embodiment of the present invention relates to a method to identify an 
epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose. Preferably, 
such a method is useful for identifying the GDP-D-mannose:GDP-L-galactose epimerase 

15 which catalyzes the conversion of GDP-D-mannose to GDP-L-galactose in the 
endogenous (Le., naturally occurring L-ascorbic acid biosynthetic pathway of 
microorganisms and/or plants). Such a method can include the steps of: (a) contacting a 
source of nucleic acid molecules with an oligonucleotide at least about 12 nucleotides in 
length under stringent hybridization conditions, wherein the oligonucleotide is identified 

20 by its ability to hybridize under stringent hybridization conditions to a nucleic acid 
sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3 and SEQ 
ID NO:5; and, (b) identifying nucleic acid molecules from the source of nucleic acid 
molecules which hybridize under stringent hybridization conditions to the oligonucleotide. 
Nucleic acid molecules identified by this method can then be isolated from the source 

25 using standard molecular biology techniques. Preferably, the source of nucleic acid 
molecules is obtained from a microorganism or plant that has an ascorbic acid production 
pathway. Such a source of nucleic acid molecules can be any source of nucleic acid 
molecules which can be isolated from an organism and/or which can be screened by 
hybridization with an oligonucleotide such as a probe or a PCR primer. Such sources 

30 include genomic and cDNA libraries and isolated RNA. 
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In order to screen cDNA libraries from different organisms and to isolate nucleic 
acid molecules encoding enzymes such as the GDP-D-mannose:GDP-L-galactose 
epimerase and related epimerases, one can use any of a variety of standard molecular and 
biochemical techniques. For example, oligonucleotide primers, preferably degenerate 
5 primers, can be designed using the most conserved regions of a GDP-4-keto-6-deoxy-D- 
mannose epimerase/reductase nucleic acid sequence, and such primers can be used in a 
polymerase chain reaction (PCR) protocol to amplify the same or related epimerases, 
including the GDP-D-mannose:GDP-L-galactose epimerase from the ascorbic acid 
pathway, from nucleic acids (e.g., genomic or cDNA libraries) isolated from a desired 

10 organism (e.g., a microorganism or plant having an L-ascorbic acid pathway). Similarly, 
oligonucleotide probes can be designed using the most conserved regions of a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase nucleic acid sequence and such probe can 
be used to identify and isolate nucleic acid molecules, such as from a genomic or cDNA 
library, that hybridize under conditions of low, moderate, or high stringency with the 

15 probe. 

Alternatively, the GDP-D-mannose:GDP-L-galactose epimerase can be purified 
from an organism such as Prototheca, the N-terminal amino acid sequence can be 
determined (including the sequence of internal peptide fragments), and this information 
can be used to design degenerate primers for amplifying a gene fragment from the 

20 organism cDNA This fragment would then be used to probe the cDNA library, and 
subsequently fragments that hybridize to the probe would be cloned in that organism or 
another suitable production organism. There is ample precedent for plant enzymes being 
expressed in an active form in bacteria, such as E. coli. Alternatively, yeast are also a 
suitable candidate for developing a heterologous system for L-ascorbic acid production. 

25 As discussed above in general for increasing the action of an enzyme in the L- 

ascorbic acid pathway according to the present invention, in one embodiment of the 
present invention, the action of an epimerase that catalyzes the conversion of GDP-D- 
mannose to GDP-L-galactose is increased by amplification of the expression (i.e., 
overexpression) of such an epimerase. Overexpression of an epimerase can be 

30 accomplished, for example, by introduction of a recombinant nucleic acid molecule 
encoding the epimerase. It is preferred that the gene encoding an epimerase according to 
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the present invention be cloned under control of an artificial promoter. The promoter can 
be any suitable promoter that will provide a level of epimerase expression required to 
maintain -a sufficient level of L-ascorbic acid in the production organism. Preferred 
promoters are constitutive (rather than inducible) promoters, since the need for addition 
5 of expensive inducers is therefore obviated. The gene dosage (copy number) of a 
recombinant nucleic acid molecule according to the present invention can be varied 
according to the requirements for maximum product formation. In one embodiment, the 
recombinant nucleic acid molecule encoding an epimerase according to the present 
invention is integrated into the chromosome of the microorganism. 

10 It is another embodiment of the present invention to provide a microorganism 

having one or more epimerases according to the present invention with improved affinity 
for its substrate. An epimerase with improved affinity for its substrate can be produced 
by any suitable method of genetic modification or protein engineering. For example, 
computer-based protein engineering can be used to design an epimerase protein with 

15 greater stability and better affinity for its substrate. See for example, Maulik et al., 1997, 
Molecular Biotechnology: Therapeutic Applications and Strategies, Wiley-Liss, Inc., 
which is incorporated herein by reference in its entirety. 

As noted above, in the method for production of L-ascorbic acid of the present 
invention, a microorganism having a genetically modified L-ascorbic acid production 

20 pathway is cultured in a fermentation medium for production of L-ascorbic acid. An 
appropriate, or effective, fermentation medium refers to any medium in which a genetically 
modified microorganism of the present invention, when cultured, is capable of producing 
L-ascorbic acid. Such a medium is typically an aqueous medium comprising assimilable 
carbon, nitrogen and phosphate sources. Such a medium can also include appropriate 

25 salts, minerals, metals and other nutrients. One advantage of genetically modifying a 
microorganism as described herein is that although such genetic modifications can 
significantly alter the production of L-ascorbic acid, they can be designed such that they 
do not create any nutritional requirements for the production organism. Thus, a minimal- 
salts medium containing glucose as the sole carbon source can be used as the fermentation 

30 medium. The use of a minimal-salts-glucose medium for the L-ascorbic acid fermentation 
will also facilitate recovery and purification of the L-ascorbic acid product. 
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In one mode of operation of the present invention, the carbon source 
concentration, such as the glucose concentration, of the fermentation medium is 
monitored during fermentation. Glucose concentration of the fermentation medium can 
be monitored using known techniques, such as, for example, use of the glucose oxidase 

5 enzyme test or high pressure liquid chromatography, which can be used to monitor 
glucose concentration in the supernatant, e.g., a cell-free component of the fermentation 
medium As stated previously, the carbon source concentration should be kept below the 
level at which cell growth inhibition occurs. Although such concentration may vary from 
organism to organism, for glucose as a carbon source, cell growth inhibition occurs at 

10 glucose concentrations greater than at about 60 g/L, and can be determined readily by 
trial. Accordingly, when glucose is used as a carbon source the glucose concentration in 
the fermentation medium is maintained in the range of from about 1 g/L to about 100 g/L, 
more preferably in the range of from about 2 g/L to about 50 g/L, and yet more preferably 
in the range of from about 5 g/L to about 20 g/L. Although the carbon source 

15 concentration can be maintained within desired levels by addition o£ for example, a 
substantially pure glucose solution, it is preferred to maintain the carbon source 
concentration of the fermentation medium by addition of aliquots of the original 
fermentation medium. The use of aliquots of the original fermentation medium are 
desirable because the concentrations of other nutrients in the medium (e.g. the nitrogen 

20 and phosphate sources) can be maintained simultaneously. Likewise, the trace metals 
concentrations can be maintained in the fermentation medium by addition of aliquots of 
the trace metals solution. 

In an embodiment of the fermentation process of the present invention, a 
fermentation medium is prepared as described above. This fermentation medium is 

25 inoculated with 

an actively growing culture of genetically modified microorganisms of the present 
invention in an amount sufficient to produce, after a reasonable growth period, a high cell 
density. Typical inoculation cell densities are within the range of from about 0.1 g/L to 
about 15 g/L, preferably from about 0.5 g/L to about 10 g/L and more preferably from 

30 about 1 g/L to about 5 g/L, based on the dry weight of the cells. The cells are then grown 
to a cell density in the range of from about 10 g/L to about 100 g/L preferably from about 
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20 g/L to about 80 g/L, and more preferably from about 50 g/L to about 70 g/L. The 
residence times for the microorganisms to reach the desired cell densities during 
fermentation are typically less than about 200 hours, preferably less than about 120 hours, 
and more preferably less than about 96 hours. 
5 The microorganisms useful in the method of the present invention can be cultured 

in conventional fermentation modes, which include, but are not limited to, batch, fed- 
batch, and continuous. It is preferred, however, that the fermentation be carried out in 
fed-batch mode. In such a case, during fermentation some of the components of the 
medium are depleted. It is possible to initiate fermentation with relatively high 

10 concentrations of such components so that growth is supported for a period of time before 
additions are required. The preferred ranges of these components are maintained 
throughout the fermentation by making additions as levels are depleted by fermentation. 
Levels of components in the fermentation medium can be monitored by, for example, 
sampling the fermentation medium periodically and assaying for concentrations. 

1 5 Alternatively, once a standard fermentation procedure is developed, additions can be made 
at timed intervals corresponding to known levels at particular times throughout the 
fermentation. As will be recognized by those in the art, the rate of consumption of 
nutrient increases during fermentation as the cell density of the medium increases. 
Moreover, to avoid introduction of foreign microorganisms into the fermentation medium, 

20 addition is performed using aseptic addition methods, as are known in the art. In addition, 
a small amount of anti-foaming agent may be added during the fermentation. 

The present inventors have determined that high levels of magnesium in the 
fermentation medium inhibits the production of L-ascorbic acid due to repression of 
enzymes early in the production pathway, although enzymes late in the pathway (i.e., from 

25 L-galactose to L-ascorbic acid) are not negatively affected (See Examples). Therefore, 
in a preferred embodiment of the method of the present invention, the step of culturing 
is carried out in a fermentation medium that is magnesium (Mg 24 ) limited. Even more 
preferably, the fermentation is magnesium limited during the cell growth phase. 
Preferably, the fermentation medium comprises less than about 0.5 g/L of Mg* + during the 

30 cell growth phase of fermentation, and even more preferably, less than about 0.2 g/L of 
Mg 2 *, and even more preferably, less than about 0. 1 g/L of Mg 2 *. 
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The temperature of the fermentation medium can be any temperature suitable for 
growth and ascorbic acid production, and may be modified according to the growth 
requirements of the production microorganism used. For example, prior to inoculation 
of the fermentation medium with an inoculum, the fermentation medium can be brought 
5 to and maintained at a temperature in the range of from about 20°C to about 45 °C, 
preferably to a temperature in the range of from about 25°C to about 40°C, and more 
preferably in the range of from about 30°C to about 38°C. 

It is a further embodiment of the present invention to supplement and/or control 
other components and parameters of the fermentation medium, as necessary to maintain 

1 0 and/or enhance the production of L-ascorbic acid by a production organism. For example, 
in one embodiment, the pH of the fermentation medium is monitored for fluctuations in 
pH. In the fermentation method of the present invention, the pH is preferably maintained 
at a pH of from about pH 6.0 to about pH 8.0, and more preferably, at about pH 7.0. In 
the method of the present invention, if the starting pH of the fermentation medium is pH 

15 7.0, the pH of the fermentation medium is monitored for significant variations from pH 
7.0, and is adjusted accordingly, for example, by the addition of sodium hydroxide. In a 
preferred embodiment of the present invention, genetically modified microorganisms 
useful for production of L-ascorbic acid include acid-tolerant microorganisms. Such 
microorganisms include, for example, microalgae of the genera Prototheca and Chlorella 

20 (See U.S. Patent No. 5,792,63 1, ibid, and U.S. Patent No. 5,900,370, ibid.). 

The production of ascorbic acid by culturing acid-tolerant microorganisms 
provides significant advantages over known ascorbic acid production methods. One such 
advantage is that such organisms are acidophilic, allowing fermentation to be carried out 
under low pH conditions, with the fermentation medium pH typically less than about 6. 

25 Below this pH, extracellular ascorbic acid produced by the microorganism during 
fermentation is relatively stable because the rate of oxidation of ascorbic acid in the 
fermentation medium by oxygen is reduced. Accordingly, high productivity levels can be 
obtained for producing L-ascorbic acid with acid-tolerant microorganisms according to 
the methods of the present invention. In addition, control of the dissolved oxygen content 

30 to very low levels to avoid oxidation of ascorbic acid is unnecessary. Moreover, this 
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advantage allows for the use of continuous recovery methods because extracellular 
medium can be treated to recover the ascorbic acid product. 

Thus, the present method can be conducted at low pH when acid-tolerant 
microorganisms are used as production organisms. The benefit of this process is that at 
5 low pH, extracellular ascorbic acid produced by the organism is degraded at a reduced 
rate than if the fermentation medium was at higher pH. For example, prior to inoculation 
of the fermentation medium with an inoculum, the pH of the fermentation medium can be 
adjusted, and further monitored during fermentation. Typically, the pH of the 
fermentation medium is brought to and maintained below about 6, preferably below 5.5, 

10 and more preferably below about 5. The pH of the fermentation medium can be 
controlled by the addition of ammonia to the fermentation medium. In such cases when 
ammonia is used to control pH, it also conveniently serves as a nitrogen source in the 
fermentation medium. 

The fermentation medium can also be maintained to have a dissolved oxygen 

15 content during the course of fermentation to maintain cell growth and to maintain cell 
metabolism for I^ascoibic acid formatioa The oxygen concentration of the fermentation 
medium can be monitored using known methods, such as through the use of an oxygen 
probe electrode. Oxygen can be added to the fermentation medium using methods known 
in the art, for example, through agitation and aeration of the medium by stirring or 

20 shaking. Preferably, the oxygen concentration in the fermentation medium is in the range 
of from about 20% to about 100% of the saturation value of oxygen in the medium based 
upon the solubility of oxygen in the fermentation medium at atmospheric pressure and at 
a temperature in the range of from about 30°C to about 40°C. Periodic drops in the 
oxygen concentration below this range may occur during fermentation, however, without 

25 adversely affecting the fermentation. 

The genetically modified microorganisms of the present invention are engineered 
to produce significant quantities of extracellular L-ascorbic acid. Extracellular L-ascorbic 
acid can be recovered from the fermentation medium using conventional separation and 
purification techniques. For example, the fermentation medium can be filtered or 

30 centrifiiged to remove microorganisms, cell debris and other particulate matter, and L- 
ascorbic acid can be recovered from the cell-free supernate by conventional methods, such 
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as, for example, ion exchange, chromatography, extraction, solvent extraction, membrane 
separation, electrodialysis, reverse osmosis, distillation, chemical derivatization and 
crystallization. 

One such example of L-ascorbic acid recovery is provided in U.S. Patent No. 
5 4,595,659 by Cayle, incorporated herein in its entirety be reference, which discloses the 
isolation of L-ascorbic acid from an aqueous fermentation medium by ion exchange resin 
adsorption and elution, which is followed by decoloration, evaporation and crystallization. 
Further, isolation of the structurally similar isoascorbic acid from fermentation medium 
by a continuous multi-bed extraction system of anion-exchange resins is described by K. 
10 Shimizu,;4gr. Biol Ghent 31:346-353 (1967), which is incorporated herein in its entirety 
by reference. 

Intracellular L-ascorbic acid produced in accordance with the present invention 
can also be recovered and used in a variety of applications. For example, cells from the 
microorganisms can be lysed and the ascorbic acid which is released can be recovered by 

15 a variety of known techniques. Alternatively, intracellular ascorbic acid can be recovered 
by washing the cells to extract the ascorbic acid, such as through diafiltration. 

Development of a microorganism with enhanced ability to produce L-ascorbic add 
by genetic modification can be accomplished using both classical strain development and 
molecular genetic techniques, and particularly, recombinant technology (genetic 

20 engineering). In general, the strategy for creating a microorganism with enhanced L- 
ascorbic acid production is to (1) inactivate or delete at least one, and preferably more 
than one of the competing or inhibitory pathways in which production of L-ascorbic add 
is negatively affected (e.g., inhibited), and more significantly to (2) amplify the L-ascorbic 
acid production pathway by increasing the action of a gene(s) encoding an enzyme(s) 

25 involved in the pathway. 

In one embodiment, the strategy for creating a microorganism with enhanced L- 
ascorbic acid production is to amplify the L-ascorbic acid production pathway by 
increasing the action of GDP-D-mannose:GDP-L-galactose epimerase, as discussed 
above. Such strategy includes genetically modifying the endogenous GDP-D- 

30 mannose:GDP-L-galactose epimerase such that L-ascorbic acid production is increased, 
and/or expressing/overexpressing a recombinant epimerase that catalyzes the conversion 
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ofGDP-D-mannose to GDP-L-galactose, which includes expression of recombinant GDP- 
D-mannose:GDP-L-galactose epimerase and/or homologues thereof, and of other 
recombinant epimerases such as GDP-4-keto-6-deoxy-D-mannose epimerase reductase 
and epimerases that share structural homology with such epimerase as discussed in detail 
5 above. 

It is to be understood that a production organism can be genetically modified by 
recombinant technology in which a nucleic acid molecule encoding a protein involved in 
the L-ascoibic acid production pathway disclosed herein is transformed into a suitable 
host which is a different member of the plant kingdom from which the nucleic acid 
10 molecule was derived. For example, it is an embodiment of the present invention that a 
recombinant nucleic acid molecule encoding a GDP-D-mannose:GDP-L-galactose 
epimerase from a higher plant can be transformed into a microalgal host in order to 
overexpress the epimerase and enhance production of L-ascorbic acid in the microalgal 
production organism. 

15 As previously discussed herein, in one embodiment, a genetically modified 

microorganism can be a microorganism in which nucleic acid molecules have been deleted, 
inserted or modified, such as by insertion, deletion, substitution, and/or inversion of 
nucleotides, in such a manner that such modifications provide the desired effect within the 
microorganism. A genetically modified microorganism is preferably modified by 

20 recombinant technology, such as by introduction of an isolated nucleic acid molecule into 
a microorganism. For example, a genetically modified microorganism can be transfected 
with a recombinant nucleic add molecule encoding a protein of interest, such as a protein 
for which increased expression is desired. The transfected nucleic acid molecule can 
remain extrachromosomal or can integrate into one or more sites within a chromosome 

25 of the transfected (i.e., recombinant) host cell in such a manner that its ability to be 
expressed is retained. Preferably, once a host cell of the present invention is transfected 
with a nucleic acid molecule, the nucleic acid molecule is integrated into the host cell 
genome. A significant advantage of integration is that the nucleic acid molecule is stably 
maintained in the cell. In a preferred embodiment, the integrated nucleic acid molecule 

30 is operatively linked to a transcription control sequence (described below) which can be 
induced to control expression of the nucleic acid molecule. 
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A nucleic acid molecule can be integrated into the genome of the host cell either 
by random or targeted integration Such methods of integration are known in the art. For 
example, an£ coli strain ATCC 47002 contains mutations that confer upon it an inability 
to maintain plasmids which contain a CoEl origin of replication. When such plasmids are 
5 transferred to this strain, selection for genetic markers contained on the plasmid results 
in integration of the plasmid into the chromosome. This strain can be transformed, for 
example, with plasmids containing the gene of interest and a selectable marker flanked 
by the 5'- and 3'-termini of the K coli lacZ gene. The lacZ sequences target the incoming 
DNA to the lacZ gene contained in the chromosome. Integration at the lacZ locus 

10 replaces the intact lacZ gene, which encodes the enzyme p-galactosidase, with a partial 
lacZ gene interrupted by the gene of interest. Successful integrants can be selected for 
P-galactosidase negativity. 

A genetically modified microorganism can also be produced by introducing nucleic 
acid molecules into a recipient cell genome by a method such as by using a transducing 

15 bacteriophage. The use of recombinant technology and transducing bacteriophage 
technology to produce several different genetically modified microorganism of the present 
invention is known in the art. 

According to the present invention, a gene, for example the GDP-D- 
mannose:GDP-L-galactose epimerase gene, includes all nucleic acid sequences related to 
20 a natural epimerase gene such as regulatory regions that control production of the 
epimerase protein encoded by that gene (such as, but not limited to, transcription, 
translation or post-translation control regions) as well as the coding region itself. In 
another embodiment, a gene, for example the GDP-D-mannose:GDP-L-galactose 
epimerase gene, can be an allelic variant that includes a similar but not identical sequence 
to the nucleic acid sequence encoding a given GDP-D-mannose:GDP-L-galactose 
epimerase gene. An allelic variant of a GDP-D-mannose:GDP-L-galactose epimerase 
gene which has a given nucleic acid sequence is a gene that occurs at essentially the same 
locus (or loci) in the genome as the gene having the given nucleic acid sequence, but 
which, due to natural variations caused by, for example, mutation or recombination, has 
30 a similar but not identical sequence. Allelic variants typically encode proteins having 
similar activity to that of the protein encoded by the gene to which they are being 
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compared. Allelic variants can also comprise alterations in the 5' or 3' untranslated 
regions of the gene (e.g., in regulatory control regions). Allelic variants are well known 
to those skilled in the art and would be expected to be found within a given 
microorganism or plant and/or among a group of two or more microorganisms or plants. 

5 In accordance with the present invention, an isolated nucleic acid molecule is a 

nucleic acid molecule that has been removed from its natural milieu (i.e., that has been 
subject to human manipulation). As such, "isolated" does not reflect the extent to which 
the nucleic acid molecule has been purified. An isolated nucleic acid molecule can include 
DNA, RNA, or derivatives of either DNA or RNA. There is no limit, other than a 

10 practical limit, on the maximal size of a nucleic acid molecule in that the nucleic add 
molecule can include a portion of a gene, an entire gene, or multiple genes, or portions 
thereof. 

An isolated nucleic acid molecule of the present invention can be obtained from 
its natural source either as an entire (i.e., complete) gene or a portion thereof capable of 

15 forming a stable hybrid with that gene. An isolated nucleic acid molecule can also be 
produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) 
amplification, cloning) or chemical synthesis. Isolated nucleic acid molecules include 
natural nucleic acid molecules and homologues thereof, including, but not limited to, 
natural allelic variants and modified nucleic acid molecules in which nucleotides have been 

20 inserted, deleted, substituted, and/or inverted in such a manner that such modifications 
provide the desired effect within the microorganism. A structural homologue of a nucleic 
acid sequence has been described in detail above. Preferably, a homologue of a nucleic 
acid sequence encodes a protein which has an amino acid sequence that is sufficiently 
similar to the natural protein amino acid sequence that a nucleic acid sequence encoding 

25 the homologue is capable of hybridizing under stringent conditions to (i.e., with) a nucleic 
acid molecule encoding the natural protein (i.e., to the complement of the nucleic acid 
strand encoding the natural protein amino acid sequence). A nucleic acid molecule 
homologue encodes a protein homologue. As used herein, a homologue protein includes 
proteins in which amino acids have been deleted (e.g., a truncated version of the protein, 

30 such as a peptide), inserted, inverted, substituted and/or derivatized (e.g., by 
glycosylation, phosphorylation, acetylation, myristoylation, prenylation, palmitation, 
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amidation and/or addition of glycosylphosphatidyl inositol) in such a manner that such 
modifications provide the desired effect on the protein and/or within the microorganism 
(e.g., increased or decreased action of the protein). 

A nucleic acid molecule homologue can be produced using a number of methods 
5 known to those skilled in the art (see, for example, Sambrook et al., ibid). For example, 
nucleic acid molecules can be modified using a variety of techniques including, but not 
limited to, classic mutagenesis techniques and recombinant DNA techniques, such as site- 
directed mutagenesis, chemical treatment of a nucleic acid molecule to induce mutations, 
restriction enzyme cleavage of a nucleic acid fragment, ligation of nucleic acid fragments, 
PCR amplification and/or mutagenesis of selected regions of a nucleic acid sequence, 
synthesis of oligonucleotide mixtures and ligation of mixture groups to "build" a mixture 
of nucleic acid molecules and combinations thereof. Nucleic acid molecule homologues 
can be selected from a mixture of modified nucleic acids by screening for the function of 
the protein encoded by the nucleic acid and/or by hybridization with a wild-type gene. 
15 Although the phrase "nucleic acid molecule" primarily refers to the physical nucleic 

acid molecule and the phrase "nucleic acid sequence" primarily refers to the sequence of 
nucleotides on the nucleic acid molecule, the two phrases can be used interchangeably, 
especially with respect to a nucleic acid molecule, or a nucleic acid sequence, being 
capable of encoding a gene involved in an L-ascorbic acid production pathway. 

Knowing the nucleic acid sequences of certain nucleic acid molecules of the 
present invention allows one skilled in the art to, for example, (a) make copies of those 
nucleic acid molecules and/or (b) obtain nucleic acid molecules including at least a portion 
of such nucleic acid molecules (e.g., nucleic acid molecules including full-length genes, 
full-length coding regions, regulatory control sequences, truncated coding regions). Such 
25 nucleic acid molecules can be obtained in a variety of ways including traditional cloning 
techniques using oligonucleotide probes to screen appropriate libraries or DNA and PCR 
amplification of appropriate libraries or DNA using oligonucleotide primers. Preferred 
libraries to screen or from which to amplify nucleic acid molecule include bacterial and 
yeast genomic DNA libraries, and in particular, microalgal genomic DNA libraries. 
30 Techniques to clone and amplify genes are disclosed, for example, in Sambrook et al., 
ibid 
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The present invention includes a recombinant vector, which includes at least one 
isolated nucleic acid molecule of the present invention, inserted into any vector capable 
of delivering the nucleic acid molecule into a host microorganism of the present invention. 
Such a vector can contain nucleic acid sequences that are not naturally found adjacent to 
5 the isolated nucleic add molecules to be inserted into the vector. The vector can be either 
RNA or DNA and typically is a plasmid. Recombinant vectors can be used in the cloning, 
sequencing, and/or otherwise manipulating of nucleic acid molecules. One type of 
recombinant vector, referred to herein as a recombinant molecule and described in more 
detail below, can be used in the expression of nucleic acid molecules. Preferred 

10 recombinant vectors are capable of replicating in a transformed bacterial cells, yeast cells, 
and in particular, in microalgal cells. 

Transformation of a nucleic acid molecule into a cell can be accomplished by any 
method by which a nucleic acid molecule can be inserted into the cell. Transformation 
techniques include, but are not limited to, transfection, electroporation, microinjection and 

IS biolistics. 

A recombinant cell is preferably produced by transforming a host cell with one or 
more recombinant molecules, each comprising one or more nucleic acid molecules 
operatively linked to an expression vector containing one or more transcription control 
sequences. The phrase, operatively linked, refers to insertion of a nucleic acid molecule 

20 into an expression vector in a manner such that the molecule is able to be expressed when 
transformed into a host cell. As used herein, an expression vector is a DNA or RNA 
vector that is capable of transforming a host cell and of effecting expression of a specified 
nucleic acid molecule. Preferably, the expression vector is also capable of replicating 
within the host cell. In the present invention, expression vectors are typically plasmids. 

25 Expression vectors of the present invention include any vectors that function (i.e., direct 
gene expression) in a yeast host ceH, a bacterial host cell, and preferably a microalgal host 
cell. 

Nucleic acid molecules of the present invention can be operatively linked to 
expression vectors containing regulatory sequences such as transcription control 
30 sequences, translation control sequences, origins of replication, and other regulatory 
sequences that are compatible with the recombinant cell and that control the expression 
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of nucleic acid molecules of the present invention. In particular, recombinant molecules 
of the present invention include transcription control sequences. Transcription control 
sequences are sequences which control the initiation, elongation, and termination of 
transcription. Particularly important transcription control sequences are those which 
5 control transcription initiation, such as promoter, enhancer, operator and repressor 
sequences. Suitable transcription control sequences include any transcription control 
sequence that can function in yeast or bacterial cells or preferably, in microalgal cells. A 
variety of such transcription control sequences are known to those skilled in the art. 

It may be appreciated by one skilled in the art that use of recombinant DNA 

10 technologies can improve expression of transformed nucleic acid molecules by 
manipulating, for example, the number of copies of the nucleic acid molecules within a 
host cell, the efficiency with which those nucleic acid molecules are transcribed, the 
efficiency with which the resultant transcripts are translated, and the efficiency of post- 
translation^ modifications. Recombinant techniques useful for increasing the expression 

15 of nucleic acid molecules of the present invention include, but are not limited to, 
operatively linking nucleic acid molecules to high-copy number plasmids, integration of 
the nucleic acid molecules into the host cell chromosome, addition of vector stability 
sequences to plasmids, substitutions or modifications of transcription control signals (e.g., 
promoters, operators, enhancers), substitutions or modifications of translation^ control 

20 signals, modification of nucleic acid molecules of the present invention to correspond to 
the codon usage of the host cell, deletion of sequences that destabilize transcripts, and use 
of control signals that temporally separate recombinant cell growth from recombinant 
enzyme production during fermentation. The activity of an expressed recombinant protein 
of the present invention may be improved by fragmenting, modifying, or derivatizing 

25 nucleic acid molecules encoding such a protein. 

The following experimental results are provided for the purposes of illustration and 
are not intended to limit the scope of the invention. 
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EXAMPLES 

Example 1 

The present example describes the elucidation of the pathway from glucose to L- 
ascorbic acid through GDP-D-mannose in plants. 
5 Since the present inventors have previously shown that Prototheca makes L- 

ascorbic add (AA) from glucose, it was worthwhile to examine cultures for some of the 
early conversion products of glucose. In the past, the present inventors had concentrated 
on pathways from glucose to organic acids, based on the published pathway of L-ascorbic 
acid synthesis in animals and proposed pathways in plants. The present inventors 

10 demonstrate herein that the pathway from glucose to L-ascorbic acid involves not organic 
acids, but rather sugar phosphates and nucleotide diphosphate sugars (NDP-sugars). 

Prior to the present invention, it was known that all cells synthesize 
polysaccharides b/first forming NDP-sugars. The sugar moiety is then incorporated into 
polymer, while the cleaved NDP is recycled. A variety of polysaccharides are known, and 

IS are usually named based on the relative proportions of the sugar residues in the polymers. 
For example, a "galactomannan" contains mostly galactose, and to a lesser degree, 
mannose residues. The "biopolymer" from Prototheca strains isolated by the present 
inventors was analyzed and found to be 80% D-galactose, 18% rhamnose (D- or L- 
configuration not determined), and 2% L-arabinose. The present inventors provide 

20 evidence herein of how the respective NDP-sugars that make up the Prototheca 
biopolymer are formed, and what correlations exist between L-ascorbic acid synthesis and 
the formation of the NDP-sugar forms of the sugar residues found in the biopolymer. 

The common NDP-sugar UDP-glucose is shown in Fig. 2B. This is formed in 
plants from glucose-l-P by the action of UDP-D-glucose pyrophosphorylase. UDP- 

25 glucose can be epimerized in plants to form UDP-D-galactose, using UDP-D-glucose-4- 
epimerase. UDP-D-galactose can also be formed by phosphorylation of D-galactose by 
galactokinase to form D-galactose-l-P, which can be converted to UDP-D-galactose by 
UDP-D-galactose pyrophosphorylase. These known routes were believed to account for 
the D-galactose in the Prototheca biopolymer. The UDP-L-arabinose can be formed by 

30 known reactions beginning with the oxidation of UDP-D-glucose to UDP-D-glucuronic 
acid (by UDP-D-glucose dehydrogenase), decarboxylation to UDP-D-xylose, and 
epimerization to UDP-L-arabinose. This accounts for the arabinose residues in the 
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biopolymer. UDP-L-rhamnose is known to be formed from UDP-D-glucose, thus all 
three of the sugar moieties in the Prototheca biopolymer can be accounted for by a 
pathway through glucose- 1-P and UDP-glucose. Alternatively, if the rhamnose in the 
biopolymer is D-rhamnose, it is not formed via UDP-D-glucose, but by oxidation of GDP- 
5 D-mannose (See Fig. 1). 

GDP-D-rhamnose is formed by converting glucose, in turn, to D-glucose-6-P, D- 
fructose-6-P, D-mannose-6-P, D-mannose-l-P, GDP-D-mannose, and GDP-D-rhamnose. 
It was of interest to the present inventors that this route passes through GDP-D-mannose. 
Exogenous mannose is known to be converted to D-mannose-6-P in plants, and can enter 

10 the path above. D-mannose is converted to L-ascoihic acid by Prototheca cells cultured 
by the present inventors as well or better than glucose (see Example 4). The mechanism 
of conversion, in Chloretta pyrenoidosa , of GDP-D-mannose to GDP-L-galactose by 
GDP-D-mannose:GDP-L-galactose epimerase, has been known for years (See, Barber, 
1971, Arch Biochem. Biophys. 147:619-623, incorporated herein by reference in its 

1 5 entirety). The present inventors have discovered herein that L-galactose and L-galactono- 
y-lactone are rapidly converted to I^ascoibic acid by strains of Prototheca and Chlorella 
pyrenoidosa. Prior to the present invention, it was known that L-galactono-y-lactone is 
converted to I^ascorbic add in several plant systems, but the synthesis steps prior to this 
step were unknown. Based on the published literature and the present experimental 

20 evidence, the present inventors have determined that the L-ascorbic acid biosynthetic 
pathway in plants passes through GDP-D-mannose and involves sugar phosphates and 
NDP-sugars. The proposed pathway is shown in Fig. 1. Salient points relevant to the 
design and production of genetically modified microorganisms useful in the present 
method include: 

25 1. The enzymes leading from D-glucose to D-fiuctose-6-P are well known 

enzymes in the first, uncommitted steps of glycolysis. 

2. The enzymes involved in the conversion of D-fructose-6-P to GDP-D- 
mannose have been well characterized in plants, yeast, and bacteria, particularly 
Azotobacter vinelandii and Pseudomonas aeruginosa, which convert GDP-D-mannose 

30 to GDP-D-mannuronic acid, which is the precursor for alginate (See for example, Sa- 
Correia et al., 1987, J. Bacteriol 169:3224-3231; Koplin et al., 1992, 1 Bacteriol 
174:191-199; Oesterheketal., 1996, Plant Science 121:19-27; Feingold et al., 1980, Ihfi 
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fljflHiftmigtrv nf Plants : Vol 3: Carbohydrates, structure and function, P.K. Stampf & E.E. 
Conn, eds., Academic Press, New York, pp. 101-170; Smith et al., 1992, Mol Cell Biol 
12:2924-2930; Boles et al., 1994, Eur. J. Med 220:83-96; Hashimoto et al., 1997, J. 
Biol. Chem. 272:16308-16314, all of which are incorporated herein by reference in their 
5 entirety). 

3. Barber (1971, supra, and 1975) identified in Chlorella pyrenoidosa the 
enzyme activities for the conversion of GDP-D-mannose to GDP-L-galactose and L- 
galactose-l-P. 

4. The present inventors have shown herein the rapid conversion of L- 
10 galactose and L-galactono-y-lactone to L-ascorbic acid by Prototheca cells. 

5. L-galactono-Y-lactone and L-galactonic acid can be interconverted in 
solution by changing the pH of the solution; addition of base shifts the equilibrium to L- 
galactonic acid, while addition of acid shifts the equilibrium to the lactone. Cells may have 
an enzymatic means for this conversion in addition to this non-enzymatic route. 

15 6. In plants, GDP-L-fucose is also formed from GDP-D-mannose, presumably 

for incorporation into porysaccharide. Roberts (1971) fed labeled D-mannose to corn root 
tips and found the label in polysaccharide, specifically in the residues of D-mannose, L- 
galactose, and L-fucose. No label was detected in D-glucose, D-galactose, L-arabinose, 
or D-xylose. Prototheca and C. pyrenoidosa cells have the ability to convert L-fucose 

20 (6-deoxy-L-galactose) to a dipyridyl-positive product that was shown by HPLC not to be 
L-ascorbic acid. The present inventors believe that it is was the 6-deoxy analog of L- 
ascorbic acid. 

Example 2 

This example shows that in Prototheca, like other plants (Loewus, F.A. 1988. In: 
25 J. Priess (ed.), The Biochemistry of Plants, 14:85-107. New York, Academic Press) and 
the green microalga Chlorella pyrenoidosa (Renstrom, et al, 1983. Plant Sci. Lett. 
28:299-305), ascorbic acid (AA) production from glucose proceeds by a biosynthetic 
pathway that allows retention of the configuration of the carbon skeleton of glucose. 
Cultures of the strain UV77-247 were grown to moderate cell density in shake 
30 flasks with l- ,3 C-labeled glucose as 10% of the total glucose (40 g/L). Incubation was 
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as per the standard Mg-limited screen (see Example 3). The culture supernates were 
clarified, deionized to remove salts, lyophilized, and subjected to nuclear magnetic 
resonance (nmr) analysis to determine where in the AA molecule the 13 C was located. In 
each case, approximately 85% of the label was found at the C-l position of AA, with most 
5 of the remaining label at the C-6 position. This strongly indicated that AA is synthesized 
from glucose by a pathway that retains the carbon chain configuration, I e., C-l of 
glucose becomes C-l of AA. This has typically been observed in plants (Loewus, FA 
1988. Ascorbic acid and its metabolic products. In: The Biochemistry of Plants, ed. J. 
Priess, 14:85-107. New York, Academic Press). Animals (Mapson, L.W. and F.A. 

10 Isherwood 1956. Biochem. J. 64:151-157; Loewus, F.A. 1960. J. Biol. Chem. 
235(4):937-939) and protists such as Euglena (Shigeoka, S., etal, 1979. J. Nutr. Sd. 
Vitaminol. 25:299-307), on the other hand, synthesize AA by a pathway that involves the 
inversion of configuration, L e., C-l of glucose becomes C-6 of AA. Demonstration of 
the inversion/non-inversion nature of the pathway was an important step in determining 

15 the pathway of AA biosynthesis since the two types of pathways require different types 
of enzymatic reactions. The label found at C-6 of AA is thought to be due to metabolism 
of glucose and subsequent gluconeogenesis. The metabolism of glucose in glycolysis 
proceeds through triose-phosphate intermediates. After this, the C-l and C-6 carbons of 
glucose become biochemically equivalent. Hexose phosphates can be regenerated from 

20 the triose phosphates by gluconeogenesis, which is essentially a reversal of the degradative 
pathway. Consequently, metabolism of C-l-labeled glucose to triose phosphates with 
subsequent gluconeogenesis would result in the formation of hexose phosphate molecules 
labeled at either or both C-l and C-6. If those hexose phosphates were precursors to AA, 
one would expect the AA to be similarly labeled. Consistent with this type of "isotopic 

25 mixing" is the observation that sucrose obtained from l- !3 C-labeled glucose was labeled 
at positions 1, 6, 1' and 6*. 

Glucose can also be metabolized by the pentose phosphate pathway, the overall 
balanced equation for which is: 



3 Glucose-6-phosphate -+ 2 Fructose-6-phosphate + Glyceraldehyde-3-phosphate + 3 C0 2 
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Based on the known biochemistry, it would then be expected that the label at each 
of the caibons in glucose (Table 1 left column) would appear at the positions for the other 
molecules shown, and that these patterns would be reflected in the AA formed from C-2- 
and C-3-labeled glucose. 

5 TABLE 1 

Predicted Carbon Labeling of Metabolites of Glucose in the Pentose Phosphate Pathway 



Labeled Glucose 
Carbon 


Position of Labeled Carbon in: 


CO, 


F6P(1) 


F6P(2) 


G3P 


1 


+ 








2 




1.3 


1 




3 




2 


2,3 




4 




4 


4 


1 


5 




5 


5 


2 


6 




6 


6 


3 



15 AA recovered from cultures fed glucose labeled at C-2 or C-3 was also analyzed 

for its labeling patterns (Table 2). 



TABLE 2 

Labeling Pattern in AA after Cells were Fed 2- 13 C and 3- 13 C-glucose 



20 



Carbon 
Position in AA 


Isotopic enhancement after growth on: 


C-2 labeled glucose 


C-3 labeled glucose 


1 


1-0 


0.4 


2 


10.0 


0.9 


3 


0.5 


9.9 


4 


0 


2.8 


5 


2.2 


02 


6 


0 


0 



The data above again suggest a pathway from glucose to AA that proceeds by 
retention of configuration. As in the experiments with C-l labeled glucose, approximately 
one-fifth of the label is present in "mirror image" position to the glucose label (C-5 for C-2 
30 labeled glucose and C-4 for C-3 labeled glucose), indicating levels of gluconeogenesis 
consistent with those previously observed. 

The small, but significant amount of enhancement observed in other positions is 
consistent with flux through the pentose phosphate pathway. As predicted above, carbon 
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flux through this pathway would result in isotopic enhancement at positions 1 and 3 when 
cells were grown on 2- l3 C glucose and enhancement at position 2 when cells were grown 
on 3- 13 C glucose. This is indeed observed. That there is twice as much enhancement at 
C-l as there is at C-3 after growth on 2- l3 C glucose is also predicted. These data indicate 
5 a small but measurable amount of carbon flux through the pentose phosphate pathway. 

Example 3 

This example shows the methods for generating, screening and isolating mutants 
of Prototheca with altered AA productivities compared to the starting strain ATCC 
75669. 

10 ATCC No. 75669, identified as Prototheca moriformis RSP1385 (unicellular 

green microalga), was deposited on February 8, 1994, with the American Type Culture 
Collection (ATCC), Rockville, Maryland, 20852, USA, under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Microorganisms for the Purpose 
of Patent Procedure. Initial screening of Prototheca species and strains was reported in 

15 U.S. Patent No. 5,900,370, ibid. Table 3 lists the formulations of the media for growth 
and maintenance of the strains. Glucose for fermentors was supplied as glucose 
monohydrate and calculated on an anhydrous basis. The recipe for the trace metals 
solution is given in Table 4. The standard growth temperature was 35°C. All organisms 
were cultured axenically. 



20 TABLES 

Media for Growth and Maintenance of Prototheca Strains 
All quantities are in gfL unless otherwise specified 



Ingredient 


Liquid 


Agar 


Standard 


Mg-limHing 


i Slants 


Ferrozine 
Plates 


Standard 
Plates 


Potassium phosphate 
monobasic 


1.3 


1.3 


2.0 


0.27 


2.0 


Potassium phosphate dibasic 


3.8 


3.8 


2.0 


1.4 


2.0 


Trisodium citrate dihydrate 


7.7 


7.7 


2.6 


1.3 


2.6 


Magnesium sulfate 
heptahydrate 


0.40 


0.02 


0.4 


0.01 


0.4 


Ammonium sulfate 


3.7 


3.7 


1.0 


1.0 


1.0 


Trace Metals Solution 


2mL 


2mL 


2mL 


2mL 


2mL 


Ferrous sulfate heptahydrate 


1.5 mg 


4.5 mg 


1.5 mg 




1.5 mg 


Calcium chloride dihydrate 




025 
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Ingredient 


Liquid 


Agar 


Standard 


Mg-limiting 


Slants 


Ferrozine 
Plates 


Standard 
Plates 


Manganous sulfate 
monohydrate 




0.08 








Yeast extract 






2.5 






Agar 






15 


15 
(Noble) 


15 


pH before autoclaving 


72 


72 


72 


72 


72 


Autoclave, then add 


Copper sulfate, 
pentahydrate, 100g/L 








2 ml 




40 g/L Ferrozine in 5 mM 
phosphate (pH 7.5 final) 








j 8.8 mL 




Ferric ammonium sulfate 
dodecahydrate, 40 g/L 








3.8 mL 




50% glucose with 25 mg/L 
thiamine HCI 


40 mL 


60 mL 


10 mL 


10 mL 


10 mL 



IS TABLE 4 

Trace Metals Solution 







Cone, of Individ. 


mL Indiv. Stock per 


Compound 


Molecular Weight 


Solutions, g/L 


liter of Working Stock 


Distilled Water 






823 


Hydrochloric Acid 




Cone. 


20 


Cobalt Chloride hexahydrate 


237.9 


24.0 


6.5 


Boric add 


61.8 


38.1 


24 


Zinc sulfate heptahydrate 


287.5 


35.3 


50 


Manganous sulfate 


169.0 


24.6 


50 


monohydrate 








Sodium molybdate dihydrate 


242.0 


23.8 


2.0 


Calcium chloride dihydrate 


147.0 




11.4g 


Vanadyl sulfate dihydrate 


199.0 


10.0 


8.0 


Nickel nitrate hexahydrate 


290.8 


5.0 


8.0 


Sodium selenite 


173.0 


5.0 


8.0 



30 Mutant isolates were generated by treatment with one or more of the following 

agents: nitrous acid (NA); ethyl methane sulfonate (EMS); or ultraviolet light (UV). 
Typically, glucose-depleted cells grown in standard liquid medium were washed and 
resuspended in 25 mM phosphate buffer, pH 7.2, diluted to approximately 10 7 
colony-fonning units per mL (cfu/mL), exposed to the mutagen to achieve about 99% kill, 

35 incubated 4-8 hours in the dark, and spread onto standard agar medium, or agar media 
containing differential agents. 
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Some mutant colonies on standard agar medium were picked randomly and 
subcumired to master plates. Other isolation plates were inverted over chloroform to lyse 
cells on the surface of the colonies and allow them to release AA. Released AA was 
detected by spraying the treated plates with a solution of 2,6-dichrorophenol-indophenol 
5 (1.25 g/L in 70% EtOH). The ability of AA to reduce this blue redox dye to its colorless 
form is the basis for a standard assay of AA (Omaye, et aL, 1979. Meth. EnzymoL 
62:3-11.). Colonies derived from mutagenized cells were saved to master plates for 
further evaluation if their clear halos were significantly larger than the halos typical of the 
other mutants in that group. Other mutagenized cells were spread onto plates containing 
10 an AA detection system incorporated directly into the agar. This system is based on the 
ability of AA to reduce ferric iron to ferrous iron. The compound ferrozine 
(3-(2-pyridyl)-5,6- bis(4-phenylsulfonic acid)-l,2,4-triazine) was present in the agar to 
complex with the ferrous iron and give a violet color reaction. The ferrozine agar 
formulation is shown in Table 3. Colonies giving the darkest color reactions were 
15 master-plated. When screening for non-AA-producing strains (blocked mutants), white 
colonies were chosen against a background of relatively dark colonies. 

For primary screening of tube cultures, cells were inoculated from master plates 
into 4 mL of Mg-limhing medium in 16 x 125 mm test tubes, and tubes were shaken in a 
slanted position on a rotary shaker at 300 rpm for four days. After both three and four 
> days of incubation aliquots were removed for AA assay and cell density determination. 
Those for AA assay were centrifuged at 1500 x g for 5 min and the resulting supernates 
were removed for either colorimetric assay or high pressure liquid chromatography 
(HPLC). Promising isolates were retested in tube culture. Those passing the tube screen 
were tested in shake flasks. 

For secondary screening of flask cultures, cells were inoculated into 50 mL of 
standard flask medium in 250 mL baffled shake flasks, and incubated on a rotary shaker 
at 180 rpm until glucose depletion (24-48 hours). A second series of flasks of 
Mg-sufficient standard medium was inoculated from the first set to a cell density of 0.15 
A< 20 , and incubated for 24 hours. A third series of Mg-limiting flask medium was 
inoculated from the second set by a 1/50 dilution and incubated for 96 hours. Flasks were 
sampled for AA analysis and cell density measurements during this time as required. 
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Aliquots for supernatant AA analysis were centrifuged at 5000 x g for 5 min. Aliquots 
for total whole broth AA analysis were first extracted for 15 min with an equal volume of 
5% trichloroacetic acid (TCA) before centrifugation. Aliquots of the resulting supernates 
were removed for either colorimetric assay or HPLC analysis. 
5 For colorimetric assay of AA, a modification of the method of Omaye, et al 

(1979. Meth. Enzymol. 62:3-11) was used. Twenty-five nL aliquots of culture supernates 
were added to wells of 96-well microplates, and 125 fiL of color reagent was added. The 
color reagent consisted of four parts 0.5% aqueous 2,2-dipyridyl and one part 8.3 mM 
ferric ammonium sulfate in 27 % (v/v) o-phosphoric acid, the two components being 

10 mixed immediately before use. After one hour, the absorbance at 520 nm was read. AA 
concentration was calculated by comparison of the absorbances of AA standards. 

HPLC analysis was based on that of Running, et al, (1994). Supernates were 
chromatographed on a Bio-Rad HPX-87H organic acid column (Bio-Rad Laboratories, 
Richmond, CA) with 13 mM nitric acid as solvent, at a flow rate of 0.7 mL/min at room 

15 temperature. Detection was at either 254 nm using a Waters 441 detector (MBlIipore 
Corp., Milford, MA), or at 245 nm using a Waters 481 detector. This system can 
distinguish between the L- and D- isomers of AA 

For dry weight determinations of cell density, 5 mL whole broth samples were 
centrifuged at 5000 x g for 5 min, washed once with distilled water, and the pellet was 

20 washed into a tared aluminum weighing pan. Cells were dried for 8-24 h at 105 °C. Cell 
weight was calculated by difference. 

Table 5 shows the abilities of various mutants of Prototheca to synthesize AA 



TABLE 5 

AA Synthesizing Ability of Various Prototheca Mutants in Flask Screen 





Specific AA Formation, mg AA per L/Culture A^o. 


Strain 


during Mg-limited incubation 




2 Days Incubation 


4 Days Incubation 


ATCC 75669 


22 


35 


EMS13-4 


79 


166 


UV213-1 


0 


0 


UV218-1 


0 


0 


UV244-1 


0 


0 
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Strain 


Specific M Formation, mg AA per L/Culture A^o, 
during Mg-limited Incubation 


2 Days Incubation 


4 Days Incubation 


UV244-15 


58 


68 


UV77-247 


56 


83 


UV140-1 


67 


100 


UV164-6 


81 


131 


NA21-14 


27 


78 


UV82-21 


0 


0 


UV127-10 


50 j 


85 


SP2-3 


3 


4 



10 The genealogy of these isolates is presented graphically in the "family tree" of Fig. 

3. ATCC No. , identified as Protoiheca moriformis EMS13-4 (unicellular green 

microalga), was deposited on May 25, 1999, with the American Type Culture Collection 
(ATCC), 10801 University Boulevard, Manassas, VA 20110, USA, under the terms of 
the Budapest Treaty on the International Recognition of the Deposit of Microorganisms 

15 for the Purpose of Patent Procedure. ATCC No. , identified as Protoiheca 

moriformis UV127-10 (unicellular green microalga), was deposited on May 25, 1999, 
with the American Type Culture Collection (ATCC), 10801 University Boulevard, 
Manassas, VA 201 10, USA, under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure. 

20 ATCC No. , identified as Protoiheca moriformis SP2-3 (unicellular green 

microalga), was deposited on May 25, 1999, with the American Type Culture Collection 
(ATCC), 10801 University Boulevard, Manassas, VA 20110, USA, under the terms of 
the Budapest Treaty on the International Recognition of the Deposit of Microorganisms 
for the Purpose of Patent Procedure. 

25 Example 4 

The following example shows that both growing and resting cells of Prototheca 
can rapidly convert L-galactose and L-galactono-y-lactone to AA, and that conversion 
of D-mannose to AA by Prototheca is more rapid than conversion of D-glucose. 

Shake flask cultures of the mutant strain UV77-247 were grown to glucose 
30 depletion in standard liquid medium (Table 3). Cells were washed twice and resuspended 
in complete medium with the glucose substituted by one of the compounds listed below. 
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Cell suspensions were incubated for 24 hours at 35° C with shaking, and the entire 
suspension was extracted with TCA as above and assayed for AA. 

Tables 6-8 show that both growing and resting cells of strain UV77-247 can 
rapidly convert L-galactose and L-galactono-y -lactone to AA. In these experiments, 
D-fructose and D-galactose were converted to AA at the same rate as D-glucose, 
suggesting that they are metabolized to AA through the same route as D-glucose. None 
of the organic acids suggested in the literature to be intermediates in the biosynthesis of 
AA were converted to AA, including sorbosone, which has been proposed as an 
intermediate by Saito et a/.(1990 Plant Physiol. 94:1496-1500). 



10 



15 



20 



25 



TABLE 6 

Conversion of Compounds by Resting Cells of Strain LA/77-247 



30 



Substrate (50 mM) 


Total AA, mg/L 


AA Relative to No 
Substrate Control 


L-galactose 


965 


623 


L-galactono-v-lactone 


818 


476 


D-fructose 


590 


248 


D-glucosone 


589 


247 


D-glucose 


584 


242 


D-galactose 


542 


200 


D-glucose (10 mM) 


388 


46 


D-gluconolactone 


382 


40 


D-gulono-Y-lactone 


366 


24 


D-glucuronate 


364 


22 


L-sorbosone 


342 


0 


None 


342 


0 


2-keto-D-gluconic acid 


341 


-1 


D-tsoascorbic acid (10 mM) 


330 


-12 


D-glucuronolactone 


329 


-13 


D-gluconic acid 


309 


-33 


D-galacturonic acid 


297 


-45 


L-idonate 


296 


-46 



35 



Since strain UV77-247 converted L-galactose and L-galactono-y -lactone to AA 
much more rapidly than it did glucose, it suggests that these compounds are intermediates 
in the AA biosynthetic pathway and that they are "downstream" from glucose. 

The data in Tables 7 and 8 also show that growing and resting cells of UV77-247 
consistently convert D-mannose to AA at a rate greater than that of glucose. 
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TABLET 

Conversion of Compounds to AA by Resting Cells of Strain UV77-247 



Compound 


Ascorbic Acid, mg/L 


25.5 h 


30 h 


47 h 


L-galactose 


667 


718 


620 


L-galactono-Y-lactone 


644 


681 


749 


D-glucosone 


465 


462 


354 { 


D-mannose 


448 


462 


399 


D-fructose 


402 


408 


367 


d-glucose 


395 


404 


351 


D-galactose 


352 


361 


337 


none 


287 


288 


258 



TABLE 6 

Conversion of Compounds to AA by Growing Cells of Strain UV77-247 



Compound 


Ascorbic Acid, mg/L | A^ 


I AA/A^ 


25.5 h 


44 h 


L-galactose 


249 


506 


4.5 


112 


D-mannose 


228 


488 


5.6 


87 


L-galactono-Y-lactone 


214 


342 


5.0 


68 


D-glucose 


178 


398 


5.9 


67 


D-fructose 


181 


383 


5.9 


65 


D-glucosone 


176 


362 


5.7 


64 


D-galactose 


185 


380 


5.9 


64 


none 


182 


249 


4.4 


57 


D-gluconic acid (K) 


178 


262 


5.0 


52 


L-ldonate (Na) 


182 


232 


4.7 


49 


2-keto-D-gluconic acid 


182 


255 


5.3 


46 


2-deoxy-D-glucose 


181 


227 


4.8 


47 


D-glucuronic acid lactone 


165 


218 


5.0 


44 


D-glucuronic acid (Na) 


173 


241 


5.6 


43 


L-gulono-y-lactone 


152 


195 


5.0 


39 


L-sorbosone 


178 


160 


4.7 


34 


D-glucono-6-lactone < 


130 


190 


5.7 


33 


i D-galacturonic acid 


130 


180 


6.0 


30 



These cells converted L-galactose, L-galactono-y-lactone and D-mannose to AA 
more rapidly than they did glucose, suggesting that mannose exerts its effect in the 
35 biosynthetic pathway "downstream" from glucose. 
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Example 5 

Using the methods described above, a collection of mutants was assembled. The 
specific AA formation for representative mutants are shown in Table 5. The genealogy 
of these isolates is presented graphically in the "family tree" of Fig. 3. 
5 These isolates were tested for their ability to convert compounds which could be 

converted to AA by strain UV77-247. Testing was done as in Example 4. Results are 
shown in Table 9. 

TABLE 9 

Conversion of Compounds to AA by Resting Cells 
10 of Mutant Strains of Prototheca of Varying Abilities to Synthesize AA 



| Strain 


Absolute AA, mg/L 


Buffer 


Glucose 


L-galactose 


L-gal-Y-lact 


Mannose 


Fructose 


EMS13-4 


S3 


87 


191 


173 


139 


ND 


UV1 27-10 


45 


I 140 


213 


140 


128 


143 


SP2-3 


19 


19 


204 


146 


24 


27 


NA21-14 


61 


80 


147 


158 


118 


115 


UV82-21 


15 


16 


183 


175 


18 


17 


UV213-1 


16 


15 


170 


135 


17 


16 


UV218-1 


16 


18 


136 


[ 176 


19 


21 


UV244-1 


16 


16 


164 


162 


16 


16 


UV244-15 


26 


77 


30 


21 


94 


89 


UV244-16 


28 


64 


53 


53 


53 


66 



ND = Not Determined 



These data suggest that the mutational blocks in those strains which convert 
fructose and mannose to AA poorly are before ("upstream" from) L-galactose and 
25 L-galactono-Y-lactone in the pathway. 

Example 6 

The following example shows that magnesium inhibits eaiiy steps in the production 

of AA. 

To address the question of whether magnesium actually inhibits AA synthesis, 
30 strain NA45-3 (ATCC 209681) was grown in magnesium (Mg)-limited and Mg-sufficient 
medium. ATCC No. 209681, identified as Prototheca moriformis NA45-3 (Source: 
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repeated mutagenesis of ATCCNo. 75669; Eucaryotic alga. Division Chlorophyta, Class 
Chlorophyceae, Order Chlorococcales), was deposited on March 13, 1998, with the 
American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, VA 
201 10, USA, under the terms of the Budapest Treaty on the International Recognition of 
5 the Deposit of Microorganisms for the Purpose of Patent Procedure. Cells from both 
cultures were harvested and resuspended in the cell-free supernate from the Mg-limited 
culture, and to half of each cell suspension additional magnesium was added in order to 
bring the level in the suspension to the Mg-sufficient level. The four conditions under 
which assays were run were as follows. 

10 TABLE 10 

Conditions Used to Test the Effect of Magnesium on AA Production 



Condition 


Magnesium concentration, g/L, during: 


Growth 


Assay 


i 1Mg>1Mg 


0.02 


0.02 


1Mg>10Mg 


0.02 


02 


10Mg>1Mg 


02 


0.02 


10Mg>10Mg 


02 


0.2 



Substrates previously shown to lead to the formation of AA, namely D-glucose, 
D-glucosone, D-fructose. D-galactose, D-mannose, and L-galactono-y-lactone, were 
added at 20 g/L to the four cell suspensions. Accumulation of AA after 24 hours was 

20 measured and compared to a control in which no substrate was added. The results of this 
study are shown graphically in Fig. 4. 

When cells growing under magnesium-limited conditions were incubated with 
substrates in low-magnesium broth (lMg>lMg condition), all showed significant and 
similar accumulation of AA over the control condition. When the same cells were 

25 incubated in high magnesium broth (lMg>10Mg condition), the accumulation of AA was 
reduced about 40% for all substrates except D-mannose and L-galactono-y-lactone, 
suggesting that 1) the rate-limiting step in the conversion of D-glucose, D-glucosone, 
D-fructose, and D-galactose to AA is inhibited by magnesium or 2) magnesium stimulates 
an enzyme which results in the conversion of these compounds to some other 

30 compound(s), reducing the amount of substrate available for AA synthesis. On the other 
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hand, conversion of D-mannose and L-galactono-y-lactone appeared to be unaffected by 
the presence of magnesium in the resuspension buffer, indicating that either 1) 
magnesium-inhibited enzymes are not involved in the conversion of these substrates to AA 
or 2) D-mannose and L-galactono-y-lactone enter the pathway far enough downstream 
5 from the point where they can be siphoned off by side reactions involving Mg-requiring 
enzymes. 

When cells were grown under magnesium-sufficient conditions, very little AA 
accumulation from any of the D-sugars was observed, regardless of the level of 
magnesium in the resuspension broth. Accumulation of AA from L-galactono-y-lactone, 

10 however, was enhanced over that observed when cells are grown in Mg-limited 
conditions. This suggests that enzymes early in the pathway are repressed under 
Mg-sufficient conditions. Thus, the D-substrates all behaved similarly, with the exception 
of the apparent lack of magnesium inhibition of D-mannose conversion to AA. This 
would suggest that D-mannose enters the AA biosynthetic pathway at a point other than 

1 5 the other D-sugars. 

Figs. 2A and 2B represent some of the fates of glucose in plants. The first 
enzymatic step in this scheme which commits carbon to glycolysis is the conversion of 
fructose-6-P to fructose- 1,6-diP by phosphofhictokinase (PFK). This reaction is 
essentially irreversible, and leads to the well known TCA cycle and oxidative 

20 phosphorylation, with concomitant ATP and NADH/NADPH generation. PFK has an 
absolute requirement for magnesium. If magnesium is limiting, this reaction could slow 
and eventually stop, blocking the flow of carbon through glycolysis and beyond, and 
would result in cessation of cell division even in the presence of excess glucose. One 
would expect fructose-6-P to accumulate under these conditions, fueling AA synthesis by 

25 the pathway shown in Figs. 1 and 2. 



Example 7 

The following example shows the correlation in Prototheca between AA 
production and the activity levels of the enzymes in the AA pathway. 
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Phosohomannose isomerase fP Mfl Assay 

PMI activity was first assayed (See Fig. 1). Ten strains representing a range of 
AA productivities were grown according to the standard protocol to measure 
AA-synthesizing ability. Cells were harvested 96 hours into magnesium-limited 
5 incubation, washed and resuspended in buffer containing 50 mM Tris/10 mM MgCl* pH 
7.5. The suspended cells were broken in a French press, spun at 30,000 x g for 30 
minutes, and desalted through Sephadex G-25 (Pharmacia PD- 10 columns). Reactions 
were carried out in the reverse direction by adding various volumes of extracts to 
solutions of Tris/Mg buffer containing 0.15 U phosphoglucose isomerase (EC 5.3.1.9), 

10 0.5 Uglucose-6-phosphate dehydrogenase (EC 1.1.1.49), and 1.0 mMNADP. Reactions 
were initiated by addition of 3 mM (final) mannose-6-phosphate. Final reaction volume 
was 1.0 mL. All components were dissolved in Tris/Mg buffer. Activities were taken as 
the change in A^Jmm. From these activities was subtracted the activities measured in 
identical reaction mixtures lacking the M-6-P substrate. Specific activities were calculated 

15 by normalizing the activities for protein concentration in the reactions. Protein in the 
original extracts was determined by the method of Bradford, using a kit from Bio-Rad 
Laboratories (Hercules, CA). All enzymes and nucleotides were purchased from Sigma 
Chemical Co. (St. Louis, MO). 

Phosphomannomntase (PMM> Assay 
20 Phosphomannomutase was measured in a similar manner in the same strains, but 

these assay reaction mixtures also contained 0.25 mM glucose- 1,6-diphosphate, 0.5 U 
commercially available PMI, and the reactions were started with the addition of 3.0 mM 
(final) mannose-1 -phosphate rather than mannose-6-phosphate. 

Phosphofmctnlrinasp fPFKD Assay 
15 To shed light on the possibility that the enhancement of AA concentration in 

cultures which were limited for magnesium was due to a diversion of carbon from normal 
metabolism by a reduced activity of the first committed step in glycolysis (PFK) the strains 
were also assayed to confirm the presence of this enzyme activity. Cells were cultured, 
washed and broken as above. Extracts were centrifuged at 100,000 x g for 90 min before 
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desalting. Reactions were carried out in the forward direction by adding various volumes 
of extracts to solutions of Tris/Mg buffer containing 1.5 mM dithiothreitol, 0.86 U 
aldolase (EC 4.1.2.13), 1.4 U a-glycerophosphate dehydrogenase (EC 1.1.1.8), 14 U 
triosephosphate isomerase (EC 5.3 . 1 . 1), 0. 1 1 mM NADH, and 1 .0 mM ATP. Reactions 
5 were initiated by addition of 5 mM (final) fructose-6-phosphate. Final reaction volume 
was 1.0 mL. All components were dissolved in Tris/Mg buffer. Activities were taken as 
the change in A^min. From these activities were subtracted the activities measured in 
identical reaction mixtures lacking the F-6-P substrate. Specific activities were calculated 
by normalizing the activities for protein concentration in the reaction. Protein in the 
10 original extracts was determined as above. 

GDP-D-mannose pvrophosphorvlase (GMP^ Assay 

These same mutant strains were assayed for the next enzyme in the proposed 
pathway, GMP. Strains were grown both according to the standard Mg-limiting protocol 
(harvested 43-48 hours into magnesium-limited incubation) and in standard Mg-suflScient 

15 medium (harvesting all cells before glucose depletion). Washed cell pellets were 
resuspended in 50 mM phosphate buffer, pH 7.0, containing 20% (v/v) glycerol and 0.1 
M sodium chloride (3 mL buffer/g wet cells), and broken in a French press. Crude extracts 
were spun at 15,000 x g for 15 minutes. Reactions were carried out in the forward 
direction by adding various volumes of extracts to solutions of 50 mM phosphate/4 mM 

20 MgCl 2 buffer, pH 7.0, containing 1 mM GTP. Reactions were initiated by addition of 1 
mM (final) mannose- 1 -phosphate. Final reaction volume was 0.1 mL. Reaction mixtures 
were incubated at 30 C for 10 min, filtered through a 0.45 ^im PVDF syringe filter, and 
analyzed for GDP-mannose by HPLC. A Supelcosil SAX1 column (4.6 x 250 mm) was 
used with a solvent gradient (1 mL/min) of: A - 6 mM potassium phosphate, pH 3.6; B 

25 - 500 mM potassium phosphate, pH 4.5. The gradient was: 0-3 min, 100% A; 3-10 min, 
79% A; 10-15 min, 29% A. Column temperature was 30 C. Two assays that showed 
enzyme activity proportional to the amount of protein were averaged. Control 
no-substrate and no-extract reactions were also run. Specific activity was calculated by 
normalizing the activity for protein concentration in the reaction. Protein in the original 

3 0 extracts was determined as above. 
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nDP-D-mannose:GDP-T,- g alactose Enimerase Assay 

Further tests measured the activities of the next enzyme in the proposed pathway, 
GDP-D-mannose:GDP-L-galactose epimerase. Strains were grown according to the 
standard protocol, harvested 43-48 hours into magnesium-limited incubation, washed, and 
5 resuspended in buffer containing 50 mM MOPS/5 mM EDTA, pH 7.2. Washed pellets 
were broken in a French press, and spun at 20,000 x g for 20 min. Protein determinations 
were made as above and a dilution series of each was made, ranging from 0.4 to 2.2 mg 
protein/mL. 50 \xL aliquots of these dilutions were added to 10 aliquots of 6.3 mM 
GDP-D-mannose in which a portion of this substrate was universally labeled with l4 C in 

10 the mannose moiety. This substrate had an activity of 16 nCi/mL before dilution into the 
reaction mixture. Reactions were stopped after 10 min by transferring 20 \iL of the 
mixture into microfiige tubes containing 20 \iL of 250 mM trifluoroacetic acid (TFA) 
containing 1.0 g/L each D-mannose and L-galactose. These tubes were sealed and boiled 
for 10 min, cooled, spun for 60 sec in a Beckman Microfuge E, and 5 jiL of each 

15 hydrolysate was spotted on 20 x 20 cm plastic-backed EM Science Silica gel 60 thin-layer 
chromatography plates (#5748/7), with 1 cm lanes created by scoring with a blunt stylus. 
After drying, plates were twice chromatographed for 2.5 hours in ethyl 
acetate:isopropanol: water, 65:22.3:12.7 (plates were dried between runs). Spots of free 
sugars were visualized by spraying dried plates with 0.5% p-anisaldehyde in a 62% 

20 ethanolic solution of 0.89 M sulfuric acid and 0. 17 mM glacial acetic acid, and heating at 
105 C for about 15 min. Spots of L-galactose and D-mannose were cut from the plates 
and counted in a scintillation counter (Beckman model 2800). For time-zero control 
counts, 16.7 of each extract dilution was added to 23.3 \iL of the labeled substrate 
above, which had been diluted 1:7 with the TFA/mannose/galactose solution. 

25 Table 1 1 summarizes the results of the five enzyme assays for the strains tested, 

along with their specific AA formations. 
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TABLE 11 

Specific Enzyme Activities (mU)* of Selected Mutant Prototheca Strains 



strain 


AA Specific 
Forni, mg/g 


PMI 


PMM 


nri/ 

PrK 


GMP 


Epimerase 


Mg- 
iimnea 


MB- 
sumcient 


UV164-6 


78.4 












0.79 


EMS13-4 


737 


10.8 


69.6 


13.5 


Z.O 


o.o 


0.78 


UV140-1 


69.9 












0.78 


NA45-3 


61.4 












0.58 


UV77-247 


44.4 












0.52 


UV1 27-10 


40.1 


11.1 


45.8 


24.4 


4.3 


5.9 


0.39 


UV244-15 


24.5 


14.3 


41.5 




3.1 


5.3 


0.42 


NA21-14 


23.6 


12.1 


60.3 


47.4 


2.4 


7.6 


0.27 


ATCC 75669 


21.9 












028 


UV244-16 


5.0 


16.5 


85.6 




4.3 


52 




SP2-3 


2.0 


17.7 


47.0 


64.5 


2.0 


7.5 


0.03 


UV218-1 


0.4 


15.9 


72.1 




2.7 


7.0 


0.83 


UV213-1 


0.1 


19.7 


47.7 


32.6 


3.2 


6.7 


0.60 


UV82-21 


0.0 


14.6 


70.6 


30.4 


4.1 


7.5 


0.15 


UV244-1 


0.0 


18.2 


51.1 




5.5 


12 


0.15 



Unite: PMI and PMM, nmoles NADP reduced per min/mg protein; PFK, nmoles NAOH oxidized per 
20 min/mg protein; GMP, nmoles GDP-D-mannose formed per min/mg protein; epimerase, nmoles 

GDP-L-galactose formed per min/mg protein. 

The only enzyme which showed a strong correlation between activity and the 
ability to synthesize AA was the GDP-D-mannose:GDP-L-galactose epimerase. This 
correlation is depicted in Fig. 5. All of the strains which produced measurable amounts 

25 of AA had measurable amounts of epimerase activity. The converse was not true: four 
of the strains which synthesize little or no AA had significant epimerase activities. These 
strains are candidates for having mutations which affect enzymatic steps downstream from 
the epimerase. Since all of the strains tested can synthesize AA from L-galactose and 
L-galactono-y -lactone (see Examples 4 and 5), the genetic lesion(s) in these four mutants 

30 must lie between GDP-L-galactose and free L-galactose. 
Example 8 

The next example shows the relationship between GDP-D -manno se : GDP-L- 
galactose epimerase activity and the degree of magnesium limitation in two strains, the 
original unmutagenized parent strain ATCC 75669, and one of the best AA producers, 
35 EMS13-4 (ATCC ). 
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Four flasks of each strain were grown according to the standard protocol. One 
culture of each was harvested 24 hours into magnesium-limited incubation, and every 24 
hours thereafter for a total of four days. One flask of each strain was also harvested 24 
hours into magnesium sufficient incubation. All cultures had glucose remaining when 
5 harvested. Fig. 6 shows graphically the AA productivity and epimerase activity in 
EMS 13-4 and ATCC 75669 as the cultures became Mg-limited. Epimerase activity in 
EMS13-4 was significantly greater than that in ATCC 75669 at all time points. There was 
also a concurrent rapid rise in both AA productivity and epimerase activity in EMS13-4 
as the cultures became increasingly Mg-limited. While there was a moderate increase in 
10 AA productivity in ATCC 75669 as Mg became more limiting, there was no effect on 
epimerase activity. 
Example 9 

The following example shows the results of epimerase assays performed with 
extracts of two E coli strains into which were cloned the E coli gene for GDP-4-keto-6- 
15 deoxy-D-mannose epimerase/reductase. 

The E. coli K12 wca gene cluster is responsible for cholanic acid production; 
wcaG encodes a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase. 

The E. coli wcaG sequence (nucleotides 4 through 966 of SEQ ID NO:3) was 
amplified by PCR from E coli W31 10 genomic DNA using primers WG EcoRI 5 (5' 
20 TAGAATTCAGTAAACAACGAGTTTTTATTGCTGG 3'; SEQ ID NO: 12) and WG 
Xhol 3 (5 1 AACTCGAGTTACCCCCAAAGCGGTCTTGATTC 3'; SEQ ID NO: 13). 
The 973-bp PCR product was ligated into the vector pPCR-Script SK(+) (Stratagene, 
LaJolla, CA). The 973-bp ExoRH/XhoI fragment was moved from this plasmid into the 
ExoRE/XhoI sites of pGEX-5X-l (Amersham Pharmacia Biotech, Piscataway, NJ), 
25 creating plasmid pSW67-l. Plasmid pGEX-5X-l is a GST gene fusion vector which adds 
a 26-kDa GST moiety onto the N-terminal end of the protein of interest. E coli 
BL21(DE3) was transformed with pSW67-l and pGEX-5X-l, resulting in strains 
BL21(DE3)/pSW67-l and BL21(DE3)/pGEX-5X-l. 

The£ coli wcaG sequence (nucleotides 1 through 966 of SEQ ID NO:3) was also 
30 amplified by PCR from E. coli W3 1 10 genomic DNA using primers WG EcoRI 5-2 (5' 
CTGGAGTCGAATTCATGAGTAAACAACGAG 3'; SEQ ID NO: 14) and WG PstI 3 
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(5' AACTGCAGTTACCCCCGAAAGCGGTCTTGATTC 3'; SEQ ID NO:15). The 976- 
bp PCR product was ligated into a pPCR-Script (Stratagene). The 976-bp ExoRII/PstI 
fragment was moved from this plasmid into the ExoRII/PstI sites of expression vector 
pKK223-3 (Amersham Pharmacia Biotech), creating plasmid pSW75-2. E. coli JM105 
5 was transformed with pKK223-3 and pSW75-2, resulting in strains JM105/pKK223-3 and 
JM105/pSW75-2. 

All six strains were grown in duplicate at 37°C with shaking in 2X YTA medium 
until an optical density of 0.8-1.0 at 600 nm was reached (about three hours). 2X YTA 
contains 16 g/L tryptone, 10 g/L yeast extract, 5 g/L sodium chloride and 100 mg/L 

10 ampicillin. One of each culture was induced by adding isopropyl P-D- 
thiogalactopyranoside (IPTG) to 1 mM final concentration. All 12 cultures were 
incubated for an additional four hours, washed in 0.9% NaCl, and the cells were frozen 
at -80 °C. Prior to pelleting the cells for preparation of extracts, a portion of each culture 
was used for a plasmid DNA miniprep to confirm the presence of the appropriate plasmids 

15 in these strains. A protein preparation of each culture was also run on SDS gels to 
confirm expression of a protein of the appropriate size where expected. Frozen pellets 
were thawed, resuspended in 2.5 mL MOPS/EDTA buffer, pH 7.2, broken in a French 
Press (10,000 psi), spun for 20 min at 20,000 x g, assayed for protein as above and diluted 
to 0.01, 0.1, 1.0 and 3 mg/mL protein. 

20 Induction of the strain BL21(DE3)/pGEX-5X-l resulted in high-level expression 

of a 26-kDa protein indicating the synthesis of the native GST protein. Induction of strain 
BL21(DE3)/pSW67-l resulted in high-level expression of a 62-kDa protein, indicating the 
synthesis of the native GST protein (26K) fused to the wcaG gene product (36K). An 
aliquot of the fusion protein was treated with the protease Factor Xa (New England 

25 Biolabs, Beverly, MA), which cleaves near the GST/wcaG junction. Induction of the 
strain JM105/pSW75-2 resulted in high level expression of a 36-kDa protein, indicating 
the synthesis of the wcaG gene product. No such protein was detected in 
JM105/pKK223-3 (vector only). 

Next, it was of interest to test extracts in the standard epimerase assay described 

30 in Example 7 to determine if any of the extracts containing the wcaG product could bring 
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about the conversion of GDP-D-mannose to GDP-L-galactose. The extracts to be 

assayed are: 

ffl ,21 (PE3) Group 

1. BL21(DE3)uninduced 

5 2. BL21(DE3) induced with ImM IPTG 

3. BL21(DE3)/pGEX-5X-l uninduced 

4. BL21(DE3)/pGEX-5X-l induced with ImM IPTG 

5. BL21(DE3)/pSW67-l uninduced 

6. BL21(DE3)/pSW67-l induced with 1 mM IPTG; fusion proton intact 

10 7. BL21(DE3)/pSW67-l induced with 1 mM IPTG; GST moiety cleaved 

JM1 05 Group 

1. JM105 uninduced 

2. JM105 induced with ImM IPTG 

3. JM105/pKK223-3 uninduced 

15 4. JM105/pKK223-3 induced with 1 mM IPTG 

5. JM105/pSWI5-2 uninduced 

6. JM105/pSW75-2 induced with 1 mM IPTG 

Extracts 1 and 7 from the BL21(DE3) group and extracts 1 and 6 from the JM105 
group were tested for GDP-D-mannose:GDP-L-galactose epimerase-like activity in a pilot 

20 experiment. In this initial experiment, no epimerase activity was detected in any of the 
extracts. At this time, such a result can be attributed to a number of possibilities. First, 
it is possible that the wcaG gene product is incapable of catalyzing the conversion of 
GDP-D-mannose to GDP-L-galactose, although this conclusion can not be reached until 
several other parameters are tested. Second, it is possible that under the assay conditions 

25 which are satisfactory to measure activity for the endogenous GDP-D-mannose:GDP-L- 
galactose epimerase, the wcaG gene product does not have GDP-D-mannose:GDP-L~ 
galactose epimerase-like activity. Therefore, alternate conditions should be tested. 
Additionally, confirmation experiments should be performed to confirm the accuracy of 
the pilot conditions. Third, although the BL21(DE3) and the JM105 clones produce 

30 proteins of the expected size, the constructs have not been sequenced to confirm the 
proper coding sequence for the wcaG gene product and thereby rule out PCR or cloning 
errors which may render the wcaG gene product inactive. Fourth, the protein formed 
from the cloned sequence is full-length, but inactive, for example, due to incorrect tertiary 
structure (folding). Fifth, the gene is overexpressed, resulting in accumulation of insoluble 

35 and inactive protein products (inclusion bodies). Future experiments will attempt to 
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determine whether the constructs have or can be induced to have the ability to catalyze 
the conversion of GDP-D-mannose to GDP-L-galactose, and to use the sequences to 
isolate the endogenous GDP-D-mannose:GDP-L-galactose epimerase. 

Table 12 provides the atomic coordinates for Brookhaven Protein Data Bank 
5 Accession Code lbws: 

TABLE 12 

HEADER E PIMERASE /REDUCTASE 27"SEP-99 IMS 

TITLE C RYSTAL STRUCTURE OF GDP- 4 -KETQ- 6-DEOXY-P-MANNQSE 

TITLE 2 EPIMERASE /REDUCTASE FROM ESCHERICHIA COLI A KEY ENZYME IN 

10 TITLE 3 THE BIOSYNTHESIS OF GDP-L-FUCOSE 

COMPND MOLID: 1; ; 

COMPND 2 MOLECULE : GDP-4-KETO-6-DEOXY-D-MANNOSE EPIMERASE /REDUCTASE; 

COMPND 3 CHAIN: A; 

COMPND 4 ENGINEERED: YES; 

15 COMPND 5 BIOLOGICAL UNIT: HOMODIMER ; 

gQVRCg MPIi-IP; li 

SOURCE 2 ORGANISM SCIENTIFIC: ESCHERICHIA COLI; 

SOURCE 3 EXPRESSION SYSTEM: ESCHERICHIA COLI 

KEYWDS EP IMERAS E /R EDUCTAS E . GDP-L-FUCOSE BIOSYNTHESIS 

20 EXPDTA X-RAY DIFFRACTION 

MEKOfi PF ]^ r RT2ZITpNETT?FLORA 

REVDAT 1 13-JAN-99 1BVS Q 

AUTH DE D . RI Z Z ITONETT I VI GEVANI S TURLAB I S S OFLORA 

2Mk TITL GDP-4 -KETO-6-DEOXYD-MANNOSE EPIMERASE /REDUCTASE 

25 jZEMI, TITL 2 FROM ESCHERICHIA COLI. A KEY ENZYME IN THE 

JRNL TITL 3 BIOSYNTHESIS OF GDP-L-FU COSE, DISPLAYS THE 

JRNL TITL 4 STRU CTURAL CHARACTERISTICS OF THE RED PROTEIN 

JRNL TITL 5 HOMOLOGY SUPERFAMILY 

BEE structure (London) llifi 

30 mSL EEQL _2222 

BaaEK ] 

mum 2 

REMARK 2 RESOLUTION. 2.2 ANGSTROMS. 

BEM&EE 2 - 

35 REMMK 3 REFINEMENT. 

REMARK 3 PROGRAM I TNT 

REMARK 3 AUTHORS : TRONRUD,TEN EYCK . MATTHEWS 

REMARK 3 . 

REMARK 3 DATA USED IN REFINEMENT. 

40 REMARK 3 RESOLUTION RANGE HIGH f ANGSTROMS) ; 2.2 

REMARK 3 RESOLUTIO N RANGE LOW fftNftSTRPMS) ; 15. P 
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REMARK 
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— DATA CVTQffT (sigma(f) 


\ : 0.0 








BEH&RK. 


3 


COMPLETERS FOR RANGE 


1 ; 99.1 __ 








BHfcBK_ 


_ 3 


NUMBER OF REFLECTIONS 


I 24481 








REMARK 


.3 










5 


REMARK- 


_3 


—USING DATA ABOVE SIGMA CUTOff, 












3 


— CRPSP-VALTPATIPN METHPP 


l NONE 








REMARK 


3 


FREE R VALUE TEST SET SELECTION 


; NVLL 








REMARK. 


„ 3 


R VALVE (WORKING + TEST 


; NVLL 








REMARK- 


3 


R VALVE (WORKTNG SET) 


i mm 






10 


REMARK- 


— 3 


FREE R VALUE 


: NULL 








REMARK^ 


3 


FREE R VALUE TEST SET SIZE t*\ 


; NONE 








REMARK- 


3 


FREE R VALUE TEST SET COUNT 


; NVLL 








REMARK 


3 












REMARK 


- 3 


USING ALL DATA. NO STGMA CITTOF^ 








15 


BEfcf&BIL. 
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R VALVE (WORKING + TEST SET. KC 


! CTTPFF) 


? NVLL 
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ft VALVE (WORKING SET, f£ 


> CUTOFFS 


; P. 202 






REMARK 
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FREE R VALUE ffK 


CUTOFFS 








REMARK 


.. 3 


FREE R VALUE TEST SET SIZE (ft. wn 


CUTOFFl 


; NVLL 






REMARK 
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ERE£_* VALUE TEST SET COUNT fNO CUTOFFS 


? NVLL 
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REMARK 
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— TOTAL NUMBER OF REFLECTIONS (NO CUTOFF* 








REMARK 
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REMARK 
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NUCLEIC ACID ATOMS 5 NUT.T. 
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REMARK 


.3 


OTHER ATOMS s 1 0<3 










REMARK 


3 












REMARK 


3 


WILSON B VALUE (FROM FCALC. A« + :>) 


; NVLL 








REMARK 


3 
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RMS DEVIATIONS FROM TQEAL VALUES , 


RMS 
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REMARK 
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BOND LENGTHS (*) 


: 0.016 .• NULL NTTT.T. 




BEMARK 
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— BOND ANGLES (DEGREES \ 


: 1.65 ; 


NULL : 


NULL 




REMARK 


3 


_T0RSI0N ANGLES (DEGREES) 


: NULL : 


NULL : 


NULL 




REMARK 


.3 


PSEUDOROTATION ANGLES (DEGREES 1 . 


NULL t 


NULL ; 


NULL 




REMARK 


3 


TRIGONAL CARBON PLANES t*\ 


! NUT.T. 


MTTT.T. 
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REMARK 


3 


— GENERAL PLANES <a\ 






REMARK 


3 


__IS12TRDPIC THERMAL FACTORS (A**?) . 


NULL ; 


NTTT.T. • 


MTTT.T. 
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— NQN^BONDED CONTACTS 






RE*™** 


3 
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INCQRRPCT CHI RAL- CENTERS fCOUNTl ; 


NVLL 
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BULK SOLVENT MODELING 
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METHOD USED : NULT f 










REMARK- 


.3 


-K&2L : null 
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.3 


-ftSSL 1 MIL 
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REMARK 3 RESTRAINT LIBRARIES t 



REMARK- 



STEREOCHEMISTRY : NULL 



REMARK 
REMARK 



ISOTROPIC THERMAL FA CTOR RESTRAINTS : NULL 



REMARK 3 OTHER REFINEMENT REMARKS; NULL 



REMARK 



pFMApy 4 1BWS COMPLIES WITH FORMAT V, 2.2, 16-DEC-1996 



remark 



R.EM&RK S WARNING 



25 



10 F emark 5 1BWS: THIS IS TAYER 1 RELEASE. 
REMARK § i 



REMARK 5 PLEAS E NOTE THAT THIS ENTRY WAS RELEASED AFTER DEPOSITOR 

REMARK 5 CHECKING AND APPROVAL BUT WITHOUT PDB STAFF INTERVENTION. 

REMARK 5 AN AUXILIA RY FILE* AUX1BWS.RPT. IS AVAILABLE FROM THE 

15 REMARK 5 PDB FTP SE RVER AND IS ACCESSIBLE THROUGH THE 3DB BROWSER. 



REMARK- 

REMARK 

REMARK 



5 THE FILE CONTAINS THE OUTPUT OF THE PROGRAM WHAT CHECK AND 
5 OTHER DIAGNOSTICS. 



remark 5 NOMENCLATURE IN THIS ENTRY, INCLUDING HET RESIDUE NAMES 
20 REMARK 5 AND HET ATOM NAMES. HAS NOT BEEN STANDARDIZED BY THE PPP 



REMARK 5 PROCESSING STAFF- A LAYER 2 ENTRY WILL BE RELEASED SHORTLY 

REMARK 5 A FTER THIS STANDARDIZATION IS COMPLETED AND APPROVED PY THE 

REMARK 5 DEPOSITOR. THE LAYER 2 ENTRY WILL BE TREATED AS A 

REMARK 5 CORRECTION TO THIS ONE, WITH THE APPROPRIATE REVDAT RECORD . 

REMARK 5 _ — 

REMARK 5 FURTHER INFORMATION INCLUDING VALIDATION CRITERIA USED IN 

REMARK 5 CHECKING THIS ENTRY AND A LIST OF MANDATORY DATA 

REMARK 5 ARE AVAILABLE FROM THE PDB WEB SITE AT _ 

REMARK 5 http://WWW.PDB.BNL.GOV/. [ 
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REMARK 200 EXPERIMENTAL DETAILS 



REMARK 200 EXPERIMENT TYPE S 


X-RAY DIFFRACTION 


REMARK 200 DATE OF DATA COLLECTION S 


AUG-1997 


REMARK 200 TEMPERATURE f KELVIN) i 


12P 


REMARK 200 PH ■ 




RWKranv 9nfl NTTMRER OF CRYSTALS USED 


i 




REMARK 200 SYNCHROTRON (Y/N) , 


• N 


REMARK 200 RADIATION SOURCE 


• NONE 


REMARK 200 BEAMLINE 


; NULL 


REMARK 200 X-RAY GENERATOR MODEL 


; RIGAKU RU200 


REMARK 200 MONOCHROMATIC OR LAUE (M/U_J 


M 


REMARK 200 WAVELENGTH OR RANGE IAL- 


i 1.5410 


REMARK 200 MONOCHROMATOR 


; NVLIi 


REMARK 200 OPTICS 
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REMARK 200 








REMARK 200 


PETBCTPR TYPE 


: IMAGE PLATE 




REMARK 2PP, 


DETECTOR MANUFACTURER 


: RAXIS 




REMARK 200 


INTENSITY- INTEGRATION SOFTWARE 


: MOSFLM 




REMARK 2PP 


DATA SCALING SOFTWARE 


! SCALA 




REMARK 2PP 








REMARK 2PP. 


NUMBER OF UNIOUE REFLECTIONS 


: 24481 




REMARK 2PP 


RESOLUTION RANGE HIGH fAi 


: 2.2 




REMARK 2PP 


RESOLUTION RANGE LOW (A) 


: 15.0 


1 A 
10 


REMARK 200 


REJECTION CRITfiRTA fsiGMAfin 


: NONE 




REMARK 2PP_ 








REMARK 200 


OVERALL. 






REMARK 200 


COMPLETENESS FOR RANGE fftl 


: 99.7 




REMARK 200 


DATA REDUNDANCY 


: 4.3 


1 c 

15 


REMARK 200 


R MERGE m 


: 0.057 




REMARK 200 


R SYM m 


: NONE 




REMARK 200 


<I /SIGMA (I) > FOR THE DATA SET 
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REMARK 200 


• 






REMARK 200 


-IN THE HIGHEST RESOLUTION SHELL. 




OA 

20 


REMARK 200 


HIGHEST RESOLUTION SHELL. RANGE 


HIGH fAl : NULL 




REMARK 200 


HIGHEST RESOLUTION SHELL- RANGE LOW fAl ; MTTT.T. 




REMARK 200 


-COMPLETENESS FOR SHELL fftl 


: NULL 




REMARK 2PP 


_ DATA REDUNDANCY IN SHELL 


: NULL 




REMARK 200 


R MERGE FOR SHELL ITS 


: NULL 


2j 


REMARK 200 


R SYM FOR SHELL m 


: NULL 




REMARK 200 


<I/SIGMAm> FOR SHELL 


: NULL 




REMARK 2PP 








REMARK 200 


DIFFRACTION PROTOCOL: NULL 






REMARK 200 METHOD USED TO DETERMINE THE STRUCTURE: MIR 


1 A 

JO 


REMARK 200 SOFTWARE USED: MULL 






REMARK 2PP 


STARTING MODEL: NULL 






REMARK 200 








REMARK 200 


REMARK: NULL 
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REMARK 280 CRYSTAL 






REMARK 280 


SOLVENT CONTENT- VS f%) : NULL 






REMARK 280 


MATTHEWS COEFFICIENT. VM (ANGSTROMS** 3 /DAI • MTTT.T. 




REMARK 28P 








REMARK 280 CRYSTALLIZATION CONDITIONS: NULL 
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REMARK 290 


CRYS TALLOGRAPHIC SYMMETRY 






REMARK 290 


SYMMETRY OPERATORS FOR SPACE GROUP: P 32 2 1 




REMARK 2?P 








REMARK 29P 


SYMOP SYMMETRY 
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REMARK 29P 



1555 X.Y.g 



remark 290 



2.555 -Y.X-Y.Z+2/3 



remark 290 



J5£5 Y-X,-X,g+l/3 



REMARK 290 



4S55 Y.X.-Z 



REMARK 290 



5SSS Y-Y.-Y.1/3-Z 



REMARK 290 



6555 -X.Y-X.2/3-Z 



REMARK 290 



REMARK 290 



WHERE NNN -> OPERATOR NUMBER 



RKMARK 290 



MMM -> TRANSLATION VECTOR 



10 REMARK 290 
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REMARK 290 CRYSTALLOGRAPHIC SYMMETRY TRANSFORMATIONS 



REMARK 290 THE FOLLOWING TRANSFORMATIONS OPERATE ON THE ATOM/HETATM 
REMARK 290 RECORDS IN THIS ENTRY TO PRODUCE CRYSTALLOGRAPHICALLY 



REMARK 290 
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REMARK 290 
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REMARK i£5 THE FPLLQffINQ RESIPVES WERE NOT LQCATEP IN THE 
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REMARK 465 
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REMARK 46S 



REMARK itt 
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REMARK BOO 
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REMARK BOO CATAT.vtt^ PFlffTrTTTi 
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REMARK BOO 




REMARK BOO CATAT.VTTr RRffTriTTI 
REMARK BOO 




15 REMARK BOO STTF. nE<^RTPTT?m 



REMARK «nn 

DBBJBF 1RWS ft 



-2 116 SHS P3205S P CL ECOT.T 



SEflBJES. 
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1 A 321 MBT SER T.YS TtT,W ftRg VAT, PHK TT.F at.a ct.v ^ gjg 

2 A 321 MET VAT. CT, Y 



20 
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HETATM 55 n HOH 67 42 746 2S.1S3 93.465 1.00 27 .05 Q 

PBTATM 56 O HOH 68 53.638 -16.457 32.292 1.00 31 .71 O 

wetatm 57 n HOH 69 33.390 41.716 31.408 1.00 29.92 Q. 

ffETATM 58 O HOH 7fi 57,768 17.897 42,434 1 , QQ 25 ,75 Q 

5 HETATM 59 O HOH 71 75.647 9.164 U.766 l.pp 35.1? Q 

HETATM 60 O HOH 72 62.032 33.292 44.749 1.00 46.18 Q 

HETATM 61 O HOH 73 47.310 14.312 34.285 1.00 31.18 Q 

HETATM 62 O HOH 74 79.660 -3.947 15.913 1.00 34.63 2 

HETATM 63 O HOH 75 46.929 5,343 4,550 l.QQ 23,j4 Q 

10 hetatm 64 Q HOH 76 73.475 12.039 28.412 1.00 27.26 2 

HETATM 65 O HOH 77 46.297 -6.982 30.032 1.00 43.41 2 

HKTATM 66 O HOH 78 68.528 -3.422 40.869 1.00 38.47 2 

HETATM 67 O HOH 79 62.080 - 1.448 42.803 1,01) 24.60 2 

HKTATM 68 O HOH 80 65.330 18.150 40.726 1.00 41.00 2 

15 hetatm 69 o HOH 81 51.775 16.128 37.607 1.00 25.11 2 

HETATM 70 O HOH A3 54.266 28.682 43.313 1.00 27.61 2 

HETATM 71 O HOH 85 73.291 -15,47? 20.6Q3 1,00 37,54 2 

HETATM 72 O HOH 86 34.760 21.479 28.544 1.00 43.87 2 

HETATM 73 O HOH 87 37.326 24.131 29.677 1.00 24.47 2 

20 HKTATM 74 O HOH 88 65.168 20 .148 6.735 1.00 26.10 2 

HETATM 75 O HOH 89 59.196 12.089 13.630 1.00 25.24 2 

hetatm 76 Q HOH 91 66.576 -6.235 40.279 1.00 43.11 2 

hetatm 77 o HQfl 9J 37.339 29.394 25,51$ i.pp 27. 56 Q 

HETATM 78 O HOH 94 52.339 -17.014 42.271 1.00 48.96 2 

25 HETATM 79 O HOH 95 40.511 32.927 31.717 1.00 22.46 2 

HETATM 80 O HOH 96 78.580 13.121 34.138 1.00 27.98 2 

HETATM 81 O HOH 97 65 , 090 15.7P4 34.876 l.PQ IP, 9$ 2 

HETATM 82 O HOH 99 84,562 2.951 27.181 l.PP 3S, 92 2 

hetatm R3 o hoh lfifi 5Q,3B£ 9.761 9.646 l.PP 23.18 2 

30 HETATM 84 O HOH 101 67.649 -0 .851 38.764 1.00 24.99 2 

HETATM 85 O HOH 102 44,PP1 4.293 34.315 1 , pp 31.?? 2 

HETATM 86 O HOH 103 59.386 -5. 071 26.211 l.PP 29. IP 2 

hetatm 87 o hoh 104 77,364 4,74? 41-5P6 l.PP 35.32 2 

HETATM 88 Q HOH 105 59.034 21.201 32.414 1.00 23.43 2 

35 HETATM 89 O HOH 106 42.463 34.698 14.327 l.PP 38.86 2 

HETATM 90 O HOH 107 70.217 14.292 20.864 1.00 42.39 2 

HETATM 91 O HOH 108 76.999 8.13P 2?. 86? l.PP 32.91 2 

hetatm 92 o hoh ma 49.766 ?9,9?7 ?,?, , 1 7? 1-Qp 42.52 2 

HETATM 93 O HOH 110 7 2.473 13.536 38.823 l.PP 33.32 2 

40 HETATM 94 O HOH 111 64.328 -12.084 38.608 1.0P 37-99 2 

HETATM 95 o HOH 11? 60.161 16.382 4?. 68? l.PP 35.68 2 

HETATM 96 O HOH 113 47 .602 13.639 27.016 l.PP 26. PI 2 

HETATM 97 O HOH 115 64.606 11 .644 40.107 1.00 30.33 2 

HETATM 98 n HOH 116 61.231 -1S.137 27.255 l.PP 38.76 2 

45 HETATM 99 n HOH 117 65.324 -11 .223 35.P98 l.PP 3P.45 2 
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HETATM lpp p HQH no 56.60? n.219 44.93? yym 2 

HETftTM 1P1 0 HQH 12fi 37.564 19.B60 23.13S 1.00 31.27 fl 

HETftTM 102 P HQH 123 64.B45 s.0S7 21.139 i.nn 4V5i q 

HETATM 1Q3 Q HQH 123 63.391 16.801 26.89B 1 .00 3B.46 Q 

5 HETATM 1P4 P HQH 124 42.567 6.134 32.635 1.00 31.56 £ 

HETATM IP? Q H2H 125. 72.4B5 13.236 35.059 1.00 29.61 2 

HETATM IP 6 Q HQS 12fi 65.229 3.650 44.032 l.OO 36.B6 Q 

HETATM 1P7 P HQH 122 37.089 7.14B 31.0B3 1.00 39. 5B £ 

HETftTM IPS p HQH 12S 73.327 10.546 12.123 1.00 34.97 fl 

10 HETftTM IP? P HQH 122 74.450 10.299 26.59B 1.00 30. BO fl 

HETftTM UP AP?* WPP ft 1 67.524 13.055 26.692 1.00 36.4? Q 

HETftTM 111 ftC5* WPP ft 3 68.089 12.297 2S.614 1.00 9.30 £ 

HETftTM 112 AC4* WPP ft 1 69.601 12.124 25.B5B 1.00 27.73 £ 

HETftTM 113 ftPJ* WPP ft 1 70.193 11.258 24.B4B 1.00 22. B7 Q 

I 5 HETftTM 114 AC3* HPP ft 1 70.484 13.390 25.873 1.00 17B3 £ 

HETftTM 11? fl, Q 3« WPP ft 1 71.192 13.436 27.066 1.00 16.11 fl 

HETftTM 116 ftC2* WPP ft 1 71.373 13.220 24.626 1.00 11.46 £ 

HETftTM 117 AP2* WPP ft 1 72.623 13.886 24.655 1.00 31.96 Q 

HETftTM 118 HZ}* WPP ft 1 71.510 11.702 24.656 1.00 19.02 £ 

20 HETftTM 11? P3 WPP ft 1 65.336 13.590 26.129 1.00 70,59 q 

HETftTM 12P MP?' WPP ft 1 63.536 11.943 26.448 1.00 28.99 Q 

HETftTM 121 NC?« WPP ft 1 64.328 10.843 2S.957 1.00 24.89 £ 

HETATM 122 NC4* WPP A 1 63.467 9.646 25.686 1.00 31.79 £ 

HETftTM 123 NP4* WPP ft 1 62.837 9. 337 26.908 1.00 2B.B? £ 

25 HETftTM 124 HC3* WPP ft 1 62.340 9.837 24.665 1.00 11.50 £ 

HETftTM 12? WP3* WPP ft 1 62.891 9.402 23.461 1.00 2B.60 Q 

HETftTM 126 WC2* WPP ft 1 61.15? 8.996 25.1 3B i.oo ?an £ 

HETftTM 127 NP2« WPP ft 1 60.881 7. 662 24.715 1.00 24.30 £ 

HETftTM 128 NCI* WPP ft 1 61,?47 8.875 26.sao i.oo as. as £ 

30 HETftTM 12? ftP 2* WPP A 1 73.104 15.069 23.823 1 .00 39.96 £ 

HETftTM 130 APfl WPP ft 1 74.500 15. 308 24.308 1.00 37. B4 £ 

HETftTM 131 ftPP2 WPP ft 1 72.797 14. 925 22.348 1.00 36.66 £ 

HETftTM 132 ftPP3 WPP ft 1 72.163 16.917 23.958 1.00 31.97 £ 

HETftTM 133 ftP WPP ft 1 66.660 14.257 26.393 1.00 26.17 XX 

35 HETftTM 134 ftPl WPP ft 1 66.886 14.795 25.047 1.00 15.31 XX 

HETftTM 13? ftP2 WPP ft 1 66.439 15.207 27.521 1.00 34.39 XX 

HETATM 136 ftW? WPP ft 1 71.820 11.224 23.353 1.00 13.63 XX 

HETftTM 137 ftca WPP ft 1 71.104 11.316 22.200 1.00 12.41 XX 

HETATM 139 AW 7 WPP ft 1 71.758 10.835 21.161 i.oo 15.71 XX 

40 HETATM 139 ftC? WPP ft 1 72.933 10.313 21.710 1.00 16.17 XX 

HETftTM HP AC6 WPP ft 1 74.P?3 a. 657 21.140 i.oo ai.as XX 

HETftTM 141 ftN6 WPP ft 1 74.16? 9.464 19.819 i.oo i?.S9 XX 

HETftTM 142 AMI WPP A 1 75,078 9.280 21.942 1.00 17.56 XX 

HETATM 143 AC2 WPP A 1 74.971 9.578 23.251 1.00 15.44 XX 

45 HETATM 144 A»3 WPP A 1 74.027 10.302 23.889 i.oo ?4 B9 XX 
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HBTATM 145 M< HDPA 1 73.036 10.653 73.047 1.00 17. 4B XX 

WBTATM 146 HP HDP A 1 64.183 13.106 77101 1.00 25.47 H 

KF.TATM 147 WOl WPP A J 63.142 14.169 77.253 1.00 2B.fi9 N. 

CT1TATM 148 WQ2 WPP A 1 $4,837 12,643 7,8,492 1 , OQ 24,32 , M 

5 HWTATM 149 WW1 WPP A 1 60,598 9.775 77.109 1.00 23.63 H 

KF.TATM 150 WC2 WPP A 1 60.143 10.905 26.442-99.00 78.36 K 

KF.TATM 151 WC3 WPP A 1 59.070 11.648 27.007-99.00100.00 H 

KF.TATM 152 WC7 WPP A 1 58.497 13.017 26.528-99.00100.00 H 

HETATM 153 WQ7 WPP A 1 59,358 13.7Q3 25 . 972~99 , PP1PP . PP H 

10 hktatm 154 WN7 wpp a i 57.2P7 13.4P9 26.912~99.PP 84.38 H 

KF.TATM 155 WC4 WPP A 1 58.442 11.146 28.137-99.00100.00 M 

KF.TATM 156 WC5 WPP A 1 58.912 9.963 28.754-99.00100.00 N. 

HETATM 157 MC6 WPP A 1 59,951 9.266 28.147-99.00100.00 H 

ATOM 158 W LYS A 2 76.227 -5.632 44.315 1.00 61.49 H 

15 ATOM 159 CA LYS A 3 76.152 -4.302 43.684 1.00 58.00 £ 

ATOM l§2_£ LYS A 3 75.985 -4.421 42.171 1.00 52.79 £ 

ATPM 161 p LYS A 2 76.921 -4.737 41.419 l.PP 44. 76 Q 

ATOM 162 CB LYS A 3 77.359 -3.417 44.030 1.00 59.74 £ 

ATPM 163 CO LYg A 2 77.011 -1.944 44.314 1.00 50.87 £ 

20 AXCM 164 CP LYS ft 2 78.2P8 -1,161 44.894 l.PP 61.21 £ 

ATOM 165 CE LYS A 3 77.855 -0.377 46.186 1.00100.00 £ 

ATOM 166 WZ LYS A 3 78.857 -0.401 47.343 1.00 70.61 N. 

AlflM 167 K SLN A 4 74.746 -4.242 41.747 l.pp 45.15 H 

ATOM 168 CA SLW A 4 74.408 -4.326 40.347 1.00 37.18 £ 

25 ATOM 169 C GLWA 4 74.983 -3.166 39.561 1.00 34.93 £ 

ATOM 170 O GLW A 4 75.127 -2.050 40.087 1.00 28.48 Q 

ftXflM 171 CP SLW ft i 72.915 -4,44? 4P.221 l.PP 34.65 £ 

ATJ2M 172 CP glw a j 72,456 -5,854 4P.584 l.pp 31,82 £ 

ATOM 173 CD GLW A 4 72.570 -6.788 39.405 1.00 79.25 £ 

30 ATJ2M. 174 OBI GLW ft 4 72.165 -6.452 38.286 1.00100.00 £ 

ATOM 175 WE2 GLW A 4 73.206 -7.925 39.623 1.00 80.24 H 

ATOM 176 W ARC A S 75.475 -3.495 38.375 1.00 27.16 K 

ATOM 177 CA ARG A 5 76.146 -2.546 37.483 1.00 39.16 £ 

ATOM 178 C ARG A 5 75.191 -2.018 36.433 1.00 38.22 £ 

35 ATOM 179 O ARG A 5 74.938 -2.698 35.438 1.00 32.44 Q 

ATOM 1B0 CB ARG A 5 77.398 -3.163 36.826 1.00 41.76 £ 

ATOM 181 CG ARG A 5 78.692 -2.954 37.663 1.00 37.34 £ 

ATOM 182 CD ARG A 5 80.015 -3.236 36.876 1.00 32.99 £ 

ATOM 183 WE ARG A 5 81.036 -2.203 37.125 1.00 25.71 H 

40 ATOM 184 CZ ARG A 5 81.617 -1.488 36.169 1.00 32.53 £ 

ATPM 185 WH1 ARG A 5 81.293 -1.704 34.904 1.00 40.07 N. 

ftlflM 186 HH2 ARG ft 5 82.516 -P.551 36,474 1.PP1PP.PP — H 

ATOM 187 W VAL A 6 74 .743 -0.773 36.659 1.00 32.08 H 

&xai 188 CA val a g 73.715 -P.P82 35.881 l.PP 28.89 £ 

45 atom 189 c val a 6 74.161 I.PEI 34,897 l.PP 29.37 £ 
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ATOM 280 O ILE A 10 68.018 7.530 38.820 1.00 20.73 2 

ATOM 281 C* ILK A 10 66.804 7.079 35.710 1.00 17.58 C 

ATOM 282 CGI ILE A ^9 65.444 6,812 35,162, 1,00 10,09 £ 

ATOM 283 C62 TIE A 1Q 67.309 5,666 36,133 Lfflj 21.60 C 

5 ATOM 284 CDl ILK A 10 65.528 6,361 33, 741 1,90 19.95 £ 

ATOM 285 W ARS A 20 68.984 6,771 37.198 1.09 18. 13 N 

atom 286 ca arg a ?o 70.286 8.897 37.836 1.99 29.25 £ 

ATOM 287 C ARS A 20 70.231 9.491 39.242 1.00 30.62 C 

atom 288 o arg a 20 70.957 9.991 40.129 1.99 33.99 2 

10 atom 289 cb arg a 20 71.201 9.713 36.957 1.99 11.71 C 

him 299 Cg arc ft 29 72.619 9.781 37.419 l.PP 23.79 £ 

ATOM 291 CD AUG A 20 72.881 11.107 38.060 1.00 36.76 £ 

ATOM 292 WE AUG A 20 74.297 11.443 38.062 1.00 48.34 H 

ATOM 293 CZ ARG A 20 74.990 11.841 36.988 1.00100.00 £ 

15 ATOM 294 KH1 ARG A 20 74.393 11.931 35.808 1.00100.00 H 

ATOM 295 WH2 ARG A 20 76.289 12.139 37.076 1.00100.00 H 

ATOM 296 W ARS A 21 69.368 10.461 39.439 1.00 22.10 H 

ATOM 297 CA ARG A 21 69.216 11.052 40.750 1.00 17.45 £ 

ATOM 298 C ARG A 21 68.721 10.007 41.730 1.00 26.71 £ 

20 ATOM 299 O ARG A 21 69.147 10.001 42.88 5 1.00 30.27 Q 

ATOM 300 CB ARG A 21 68.142 12.144 40.708 1.00 17.93 £ 

ATOM 301 CG ARG A 21 68.682 13.522 40.321 1.00 27.57 £ 

ATOM 302 CD ARG A 21 67.586 14.599 40.130 1.00 23.02 £ 

atom 303 ke arg a 21 67.619 15.099 38.743 1.00 55.12 H 

25 ATOM 304 CZ ARG A 21 66.S38 15.103 37.995 1.00 10.55 £ 

ATOM 305 WH1 ARG A 21 65.343 14.974 38.552 1.00 29.80 H 

ATOM 306 KH2 ARG A 21 66.665 15.435 36.715 1.00 61.45 H 

ATOM 307 W GLW A 22 67.713 9.223 41.345 1 .00 27.48 H 

atom 308 ca glw a 22 67.167 8,257 42.31? 1.09 21.79 2 

30 atom 309 c glw a 22 68,137 7.127 42.547 1.09 31.37 £ 

ATOM 310 O GLW A 22 68.394 6.724 43.685 1.00 27.47 2 

atom 311 cb glw a 22 65,818 7.706 41.894 l.PP 17.11 £ 

atom 312 cg glw a 22 64.921 8.745 41-243 1.09 66.11 £ 

atom 313 cd glw a 22 63.425 8.456 41.397 1.00 41.27 £ 

35 atom 314 QE1 glw a 22 63.002 7.329 41.762 1,00 29.31 2 

ATOM 315 WE2 GLW A 22 62.610 9.464 41.046 1.00 20.12 H 

ATOM 316 W LEU A 23 68.697 6.652 41.448 1.00 27.99 H 

atom 317 ca leu a 23 69.649 5.575 41.500 1.00 24.48 £ 

atom 318 c MB ft 23 70.928 5.971 42.334 1.00 28,97 2 

40 atom 319 o leu a 23 71.288 5.218 43.165 1,09 30.79 2 

ATOM 320 CB LEU A 23 70.036 5.107 40.989 1.00 22,72 £ 

atom 321 cg leu a 23 68.966 4. 072 39,658 1.00 26.16 £ 

ATOM 322 CD1 LEU A 23 69.271 3.083 38,191 1,00 24. 8P £ 

ATOM 323 CD2 LEU A 23 68.427 3.284 40,835 1,00 22,91 £ 

45 ATOM 324 W GT.U A 24 71 .279 7.192 42.153 1.00 28.77 H 
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26 


65,968 


0.P17 


43,771 


IrPP 77.32 


N 




353 




GI.Y A 


27, 


72 r 778 


2,114 


17.266 


1.00 46,30 


N 


ATOM 


354 


OA 


G1X A 


27 


74.060 


1.531 


47.549 


1.00 46.82 


C 


ATOM 


355 


,0 . 


GLY A 


27 


74.140 


0.165 


46.923 


1.00 55.45 


c 


ATOM 


356 


0 


GLY A 


27 


75,204 


-0,153 


46.877 


1.00 64.43 


P 


ATOM 


357 


N 


A5P A 


28 


73.P17 


-P. 315 


46.428 


1,09 40-98 


N 


AT9M 


35? 


OA . 


A5P A 


28 


73.016 


-1.647 


15,861 


l.PP IP. 35 




AT5M 


359 


0 


ASP A 


28 


73.266 


-1.535 


44,100 


1,00 39,55 


C 


ATOM 


m 


0 


Asp a 


28 


73.109 


-2.518 


43,654 


l.PP 48.80 


o 


ATOM 


351 


.CB. 


ASP ft 


28 


71,680 


-2.335 


If, 127 


1.00 47,80 


c 


ATCM 


362 


. 


ASP ft 


28 


70.50? 


-1.373 


46,064 


1,00 35,34 




ATOM 


363 


0P1 


ASP A 


28 


70.705 


-0.140 


46,095 


1.00 39.23 


0 


ATOM 


364 


0P2 


ASP A 


28 


69.383 


-1,870 


45,872 


1.00 69.86 


0 


ATCM 


365 


JR .,, 


Vftl, A 


_29 


73,651 


-0.329 


43,996 


1-99 31. P3 


H 


ATOM 


366 


OA 


YAIi ft 


29 


73.88J, 


-P.05P 


42.591 


1.00 28.44 




ATPM 


367 


0 


YAI/ A 


2? 


75.166 


P.676 


42.291 


J,PP 2?, 99 


Q 


ATOM 


368 


P ... 


VWt A 


2? 


75,505 


1.699 


42,892 


1.00 34.83 


o 


ATOM 


36? 


OB... 


VAL A 


29 


72,696 


P.76P 


42, OOP 


1.0P 39,69 


C 
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AKM 


37? 


CGI VAL A 


29 


72.935 


1,088 


10.519, 


1.00 23.65 


c 




ATOM 


-.371 


CG2 VM. A 


_29 


71.416 


-0,028 


12.156. 


1.00 27.95 


c 




ATOM 


..372 


N 


GLV A 


30 


75.824 


0.219 


41,239 


1.00 30.76 


*r 




ATOM 


373 


. CA 


GLV A 


. 30 


76.99? 


0.924 


40,736 


1.00 28.38 


c 


5 


ATOM 


...374 


C 


GLtJ A 


,30 


76.678 


1.471 


39.332 


1.00 31.03 


c 




ATOM 


37? 


_P_ . 


GLO A 


-.30 


76.368 


0.720 


38.397 


1.00 26.64 


o 




ATOM 


376 


CB 


GW A 


39 


78.199 


0.006 


40.722 


1.00 31.84 


c 




ATOM 


377 


CG 


GLV A 


30 


79.35S 


0,539 


41.533 


1.00 89.26 


c 




ATOM 


37? 


CP 


GLH A 


3P 


80.667 


0.264 


40.858 


1. 00100. pp 


c 


10 


ATOM 


37? 


. QE1 


GLV A 


_3JL. 


81.082 


-0.922 


40.872 


1.00 88.94 


o 




ATOM 


. ,3?0 


OE2 GLU A 


39 


91.202 


1.206 


40.219 


1.00100.00 


o 




ATOM 


...IP J 


... H 


LEV A 


31 


76.665 


2.789 


39.207 


1.00 22.24 


N 




ATOM 


382 


PA 


LEV A 


31 


76.269 


3.391 


37.945 


1.00 29.37 


c 




ATOM 


383 


c 


LEV A 


31 


77.404 


3.507 


36.941 


1.00 25.79 


c 


15 


ATOM 


394 


o 


LEV A 


31 


78.485 


3-969 


37.256 


1.00 29.41 


p 




ATOM 


385 


CB 


LEV A 


31 


75.632 


4,760 


38.191 


1.00 30.20 


c 




ATOM 


??e 


_Cfi 


LEV A 


31 


74.329 


4.763 


38.994 


1.00 29.37 


c 




ATOM 


387 


..CD1 


LEV A 


31 


73.841 


6.143 


39,240 


1.00 23.43 


c 




ATOM 




CP 2 


LEV A 


31 


73.275 


3.962 


38,281 


1.00 23.04 


c 


20 


ATOM 


...38? 


. N 


VAL A 


32 


77.146 


3.100 


3S.711 


1.00 21.94 


N 




ATOM 


390 


CA 


VAL A 


32 


78.143 


3.265 


34.685 


1.00 25.48 


c 




ATOM 


391 


c 


VAX A 


32 


77.535 


4.242 


33.669 


1.00 38.76 


c 




ATOM 


392 


o 


VAL A 


32 


76.429 


3.999 


33.180 


1.00 29.70 


o 




ATOM 


393 


CB 


VAL A 


32 


78.517 


1.902 


34.055 


1.00 34.25 


c 


25 


ATOM 


3?4 


CGI VM- A 


32 


79.587 


2.079 


32.970 


1.00 30.56 


c 




ATOM 


.. 395 


CG2 VAL A 


32 


79.003 


0.950 


35.139 


1.00 25.27 


c 




ATOM 


396 


H 


LEO A 


33 


78.219 


5.375 


33.457 


1.00 30.19 


K 




ATOM 


397 


CA 


LEO A 


33 


77.732 


6.463 


32.621 


1.00 22.71 


c 




ATOM 


398 


c 


LEV A 


33 


78,727 


6,979 


31.645 


1.00 29.55 


c 


30 


ATOM 


399 


0 


LEV A 


33 


79.896 


7.152 


31.988 


1.00 30.09 


0 




ATOM 


400 


CB 


LEV A 


33 


77.123 


7.635 


33.514 


1.00 19.75 


c 




ATOM 


ipi 


CG 


LEO A 


33 


76.729 


7.200 


34.779 


1.00 19.38 


c 




ATOM 


402 


CD1 


LEV A 


.22 


76.814 


8.344 


3S.762 


1.00 27.24 


c 




ATOM 


. 1P3 


CD2 


LEV A 


33 


75.271 


6r913 


34.444 


1.00 22.07 


c 


35 


ATOM 


_ 4P4 


N 




34 


78.239 


7.121 


30.496 


1.00 15.09 


M 




ATOM 


405 




ARG A 


34 


79.154 


8.008 


29.541 


1.00 26.04 


c 




ATOM 


m 


C 


AR5 A 


34 


78.469 


9.173 


28.916 


1.00 36.57 


c 




ATOM 


. 407 


0 


ARG A 


34 


77.288 


9.130 


28.651 


1.00 38.59 


Q 




ATOM 


40? 


.CB 


ARG A 


34 


79,486 


7,04? 


28.398 


1.00 22.89 


C 


40 


ATOM 


40? 


CG 


ARG A 


34 


PP.579 


6,091 


28, 7p? 


1.00 23.29 






ATPM 


41Q 


CD 


ARG A 


34 


81,370 


6,575 


29.860 


1.00 52.06 


C 




ATOM 


411 


Jffi , 


ARG A 


34 


81.783 


5.158 


3P.711 


1.00 80.25 


N 




ATPM 


412 




ARG A 


34 


82.646 


4.530 


3P.323 


1.00 41.94 


c 




ATPM 


413 


mi aw? h 


34 


83.173 


4,596 


29.104 


1.0P 53. 02 


N 


45 


ATPM 


—414 


ma arc a 


34 


92.983 


3,517 


31.148 


1.00 25.56 


N 
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ATOM 415 M TOR A 35 79 ?4» 10.156 28.539 1.00 31.58 H 

atom 416 ca tor a 35 78.703 11.282 27.833 1,00 2?. 33 C 

atom 417 c thr a 35 78.719 10. 951 26.310 1,00 32. S3 £ 

ATOM 418 O TOR A 35 79,359 9,944 25.962 i.pp 2B t Q8 2 

5 ATOM 419 CB THR A 35 79.527 12.527 28.145 1.00 37.49 £ 

ATOM 420 QG1 TOR A 35 80.B44 12.429 27.560 1.00 31.91 _£ 

atom 421 CG2 TOR A 35 79.627 12.642 29.651 1.00 19.38 £ 

ATOM 422 W ARG A 36 78.032 11.780 25.529 1.00 30.02 H 

ATOM 423 CA ARG A 36 78.002 11. €39 21.056 l.QQ 29,37 £ 

10 atom 424 c ftRG a 36 79. 106 11.765 23.503 1.00 31.4$ C 

ATOM 425 O ARG A 36 79.772 11.012 22.591 1.00 36.56 S 

AX2M 426 CP &BS a, ?6 77.054 12.650 23.354 1.0P 37.31 C 

ATOM 427 CG ARG A 36 76.937 12.465 21.846-99.00 49.47 £ 

atom 428 cn arg a 36 78.020 13.515 21.232-99,00 63.09 £ 

15 ATOM 429 WE ARG A 36 75.528 13.124 19.915-99.00 75.23 M 

atom 430 cz arg a 36 74.381 13.549 19.391-99.00 91.44 £ 

ATOM 431 MH1 ARG A 36 73.605 14.375 20.079-99.00 79.32 H 

ATOM 432 WH2 ARG A 36 74.009 13.144 18.185-99.00 78.73 II 

ATOM 433 N ASP A 37 80.217 12.677 24.063 1.00 41.30 H 

20 ATOM 434 CA ASP A 37 81.606 12.710 23.601 1.00 44.91 £ 

ATOM 435 C ASP A 37 82.410 11.481 24.043 1.00 24.99 £ 

ATOM 436 O ASP A 37 83.211 10.978 23.261 1.00 42.22 Q 

ATOM 437 CB ASP A 37 82.347 14.048 23.718-99.00 47.07 £ 

ATOM 438 CG ASP A 37 81.881 14.887 24.876-99.00 62.99 £ 

25 ATOM 439 OD1 ASP A 37 8 0.679 14.839 25.204-99.00 64.45 £ 

atom 440 QD2 asp a 37 82.711 15.638 35, 429-99,00 69,84 Q 

ATOM 441 W GLU A 3B 82.129 1Q.9S0 2S.235 1.00 19.39 H 

ATOM 442 CA GLU A 38 82.790 9 .717 25.682 1.00 27.84 £ 

ATOM 443 C GUI A 38 82.203 fl.527 24.901 1.00 37.14 £ 

30 ATOM 444 O GM A 38 82.873 7.511 24.699 1.00 35.04 £ 

ATOM 445 CB GLU A 38 82. Wl 9.435 27,207 l.QQ 25,18 £ 

ATOM 446 CG GLDA 38 83.116 10.549 28.183 1.00 37.45 £ 

him 447 cd glu a 38 82.P07 10.212 29.655 i.pp 21.13 £ 

ATOM 448 OE1 GLU A 38 PI, 623 9,997 3P.014 1,00 55.97 £ 

35 atom 449 QE2 glu a 38 83.757 9.978 3P.419 l.PP 98.78 e 

ATOM 450 M LEU A 39 80.948 B.610 24.478 1 .00 25.52 H 

ATOM 451 CA LEU A 39 80.440 7.483 23.739 l.PP 18.17 £ 

atom 452 c leu a 39 79.291 7.764 22.825 l.PP 20.34 £ 

atom 453 o LEU A 39 78.152 7.81P 23.259 l.PP 26.35 £ 

40 ATOM 454 CB LEU A 39 80.123 6.313 21.657 1.P0 14.56 £ 

ATOM 455 CG LEU A 39 79.410 5.075 24.058 1,0P 19,52 £ 

ATOM 456 cm LEU A 39 80 205 4.392 22.994 1.00 18.84 £ 

atom 457 cp 2 leu a 39 78,890 4.051 25,084 1.00 12*il £ 

atom 458 w ask a 40 79.598 7 . 8P0 21.543 1.00 16.73 U 

45 atom 459 ca asn a 40 78.548 7.971 20.540 1.00 21.55 £ 
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him.. 


16P 


_.C. 


A5H A 


40 


77,798. 


6.649 


20.308 


-1.00 24.53 






ATOM 


— 461 


-.0 


ASN A 


. 4Q_ . 


78,328. 


5,729 


19.696. 


1 - 00 19,96 


C 




ATOM 


m 


, CB 


ASM A 


40 


79.130 


8.367 


19.216. 


1.00 18.45 


0 

c 




ATOM 


163 




ASM A 


19 


78.054 


8.727 


18.225. 


1.00 42.19 




5 


ATOM 


464 


ODl ASH A 


...10 


78,327 


9.093 


17.080 


1.00 38.89 


o 




ATOM 


455 


NC2 ASH A. 


49 


76.827 


8.730 


18.697 


1.00 23.71 


M 




ATOM 


466 


Jf 


LEV A 


41 


76,543 


6.622 


20.754 


1.00 21.08 






MOM 


... 467 


OA 


LEV A 


41 


75.649 


5.465 


20.650 


1.00 15.03 


M 
C 




ATOM 


469 


_C 


LEV A. 


41 


75.225 


5.068 


19.21? 


1.00 18.22 


C 


10 


ATOM 


469. 


0 


LEV A 


41 


74.681 


3.971 


18.980 


1.00 IS. 72 


o 




ATOM 


.479 


_CP 


LEV A 


41 


71-426 


5.705 


21,532 


1.00 15. 85 






ATOM 


—471 


-Off. 


LEV A 


41 


74.822 


6.02? 


22.974 


1.00 21.90 


c 
c 




ATOM 


—472 


CD1 LEV A 


41 


73.601 


6,113 


23.749 


1.00 20.59 


c 




ATOM 


47? 


_CP2 


! LEV A 


41 


75,181 


4,796 


23,609 


1.00 17.97 


c 


15 


ATOM 


474 


H 


"=P A 


42 


75,512 


5.916 


18.238 


1.00 12.45 


M 




ATOM 


.. 175 


_PA 


LEV A 


42 


75.256 


5.6Q7 


16.831 


1.00 15.99 


c 




ATOM 


476 


C 


LEV A 


42. 


76.290 


1.680. 


16.280 


1.00 26.18 


c 




ATOM 


-4?7 


. P 


LEO A 


42 


. 76.966 


4.039 


-15.257 


1.00 22.41 


o 




ATOM 


479 


-Cfi. 


LEV A 


4? 


75.282 


6.873 


15.984 


1.00 17.85 


c 


20 


ATOM 


—37? 


_£S. 


. LEV A 


42 


74.18P 


7.854 


16.399 


1.00 30.70 


c 




ATOM 


190 


CDl LEU A 


.42 


74.318 


9.184 


15.704 


1.00 24.31 


c 




ATOM 


481 


CP2 


LEU A 


42 


72.764 


7,211 


.16.209 


1.00 31.13 


c 




ATOM 


182 


N 


ASP A 


43 


77.462 


4.705 


16,911 


1.00 26.87 


M 




ATOM 


-483 


..OA 


ASP A 


43 


78.579 


3.875 


16.486 


1.00 19.29 


c 


25 


ATOM 


464 


_C 


ASP A 


43 


78.58? 


2,51? 


17.163 


1.00 13.33 


c 




ATOM 


185 


0 


A5P A 


43 


79.951 


2.318 


18,297 


1.00 18.75 






ATOM 


486 


-CP 


ASP A 


13 


79.870 


4.580 


16.776 


1.00 31.06 


Q 

c 




ATPM 


187 


CP 


ASP A 


43 


81.083 


3,758 


16.380 


1.00 30.68 


c 




ATOM 


488 


ODl ASP A 


43 


60.971 


2.551 


16.P82 


1.00 32.36 


o 


30 


ATOM 


199 


. PP2 ASP A 


43 


82,187 


4.308 


16,499 


1.00 37.83 


o 




ATOM 


499 


-H 


SER A 


44 


78.139 


1,544 


16.377 


1.00 16.89 


M 




ATPM 


491 


_CA 


SER A 


41 


77.978 


0,173 


16,799 


1.00 17.67 


c 




ATOM 


492 


0 


SER A 


44 


79,237 


-0.163 


17.392 


1.00 20.40 






ATOM 


-491 


S. 


SER A 


-44_ 


79.2P6. 


-1.126 


19.444 


1.00 26.27 


C 


35 


ATOM 


494 


St 


SEP A 


44 


77.501 


-0.617 


15,591 


1.00 13.85 


_.. 0 




ATOM 


495 


w 


SER A 


44 


76.800 


-1.740 


16.063 


1.00 43.83 


,. 0 




ATOM 


496 


N 


ARG A 


45 


80.335 


-0.301 


16.682 


1.00 15.63 


0 




ATOM 


197 


OA 


ARG A 


45 


. 91,616 


-Pr788 


17,154 . 


1.00 19.94 


N 

c 




ATOM. 


A (SO 

— 19 s 


..0 


ARC? A 


45 


81.910 


-0.225 


18.521 


1.00 29.48 




40 


ATPM 


499 


J?.. 


ARG A 


.15 


...,82.241 


-0,937 


19.457 


1.00 27.65 


C 

o 




ATOM 


500 


OB 


ARG A 


45 


B2.6B4 


-0,261 


16.203 . 


1.00 27.46 






ATPM 


591 


ce 


ARG A 


45 


.93.46? 


-1,338 


15.495 


1.00 92.03 


C 




ATOM 


592 




AFG A 


45 


84.854 


-1.419 


16.077 


1.00100.00 


c 




ATOM 


.59? 


HE 


ARG A 


15 


. 85,636 


-2,533 


15-527 


1.00100.00 


c 


45 


ATOM 


-5.04 


C2 


ARG A 


45 


...86,092 


-3.570 


16.236 


1.00100.00 


c 
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*™m 505 mhi abg a 45 -3,695 17 .547 1.00100.0° II 

ATOM 506 MH7 ABB A 45 ft 6, 773 -4.544 15.642 1.00100. OP H 

ATOM 507 M AT.A A 46 81.772 1.090 19.629 1.00 31.04 H 

atom soa rA ata a 46 82.045 L212 19.861 1.00 24.72 £ 

5 atom 509 c ata a 46 Bi.in 1,176 20.89? 1.00 17.73 : £ 

»Tm 5io o aia a 46 81.512 0.825 22.027 1.00 22.73 £2 

atom 511 cb aia a 46 81.839 3.221 19.751 1.00 27.16 £ 

at™ 512 k vat, a 47 79.B35 1.119 20.531 1.00 17.51 H 

ATflM 513 CA VAT. A 47 78.878 fc^flfl 21.508 1.00 21.41 £ 

10 ATOM 514 C VAIi A 47 79.262 -0.812 21.914 1.00 30.25 £ 

ATOM 515 O VAX A 47 79 .192 -1.202 23.097 1.00 15.95 Q 

atom 516 cb vax a 47 77.470 0.668 20.989 1.00 18.59 E 

ATOM 517 CGI VAIi A 47 76.503 Q.Q« 22.012 ItOO 16, B8 £ 

ATOM 518 CG2 VAL A 47 21x115 2 , Q96 20,756 1.00 16.28 £ 

15 ATOM 519 M HTS A 48 79,692 -1.S8S 20.920 1.00 21.00 H 

ATOM 520 CA HIS A 48 80.028 -2.969 21.192 1.00 20.17 £ 

ATOM 521 C HIS A 48 81.268 -3.079 22.117 1.00 32.98 £ 

ATOM 522 O HIS A 48 81.289 -3.BS0 23.102 1.00 28.20 Q 

atom 523 cb his a 48 80.063 -3.801 19.855 1.00 14.93 £ 

20 ATOM S24 CG HIS A 48 78.686 -4.172 19.338 1.00 26.67 £ 

ATOM S25 MD1 HTS A 48 78.0BS -5.394 19.600 1.00 28.83 II 

atom S26 cd2 hts a 48 77.758 -3.448 18.659 1.00 25.56 £ 

atom 527 cei hts a 48 76,887 -5.430 19.043 1.00 20.08 £ 

ATOM 528 WE2 HIS A 4ft 76.660 -4.260 18.475 1.00 25.22 H 

25 ATOM 529 M ASP A 49 82.217 -2.170 21.902 1.00 22.62 H 

atom 530 ca asp a 49 83.455 -2.169 22.674 1.00 24.23 £ 

ATOM 531 C ASP A 49 83.171 -1.899 24.122 1.00 38.72 £ 

532 o ftsp A 4? 83.70? -2,551 25.027 1,PP 35.44 Q 

AI2M 533 cb asp a 49 84.396 -l.u? 22.127 1.00 30.29 £ 

30 ATOM 534 CG ASP A 49 84.991 -1.503 20.775 1.00 52.45 £ 

ATOM 535 OD1 ASP A 49 85.007 -2.726 20.449 1.00 42.67 2 

ATOM 536 OD2 ASP A 49 85.416 -Q,587 20.Q29 1.P0 73,76 C 

ATOM 537 H PHE A 50 82.294 -0.929 24.324 1.00 32.19 H 

ATOM 538 CA PHE A SO 81.902 -0.550 75.649 1.00 29.76 £ 

35 atom 539 c phe a so 81.299 -1.765 26.359 l.PO 3P.31 £ 

atom 540 o phe a 50 81.715 -2.124 27.449 l.PO 29.22 Q 

atom 541 cb phe a so 80.892 0.610 25.576 1.00 23.82 £ 

atom 542 cg phe a 50 80,137 0,843 26.859 1.00 19.13 £ 

atom S43 cm phe a so 8P.74P 1.515 27.931 1.00 20.14 £ 

40 ATOM 544 CD2 PHE A SO 78.835 P. 360 27.018 1.00 13.99 £ 

ATOM 545 CEI PHE A SO 80.034 1,742 29. I?* IrPO 25.81 £ 

atom 54 6 ce2 phe a so 78.114 P.553 28.212 1.PP 22.84 £ 

atom 547 c?. phe a so 78.698 1.276 29.259 l.PP 23.40 £ 

ATOM S4B M PHE A 51 80.280 -2.367 75.768 l.PP 21.75 H 

45 ATOM 549 CA PHE A 51 79.655 -3.451 76.457 1.00 22.61 £ 
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him 550 C PHE A 51 BO. 646 -4.603 26.61? I.nn 1I.B1 £ 

&iai 551 O PHE A 51 80.S50 -5.401 27.590 1.00 25. 2B £ 

hXSSl 552 CB PHE A SI 78.389 -3.999 25.751 1.00 22.63 £ 

AICM 553 CG PHE A 51 77.158 -3.140 26.170 1.00 27.58 £ 

5 him 551 CB1 PHE A 5J 76.426 -3.525 27.280 1.00 21.78 £ 

him 555 £P2 PHE ft 52 76,66? -2.199 25. MP 1.0P 19.55 C 

him 556 CE1 PHE A 51 75.267 -2.796 27.662 1.00 28.34 £ 

him 557 CE2 PHE ft 51 75.492 -1.403 25.734 1.00 14.47 £ 

ATOM 558 CZ P HE A 51 74.797 -1.744 26.878 1.00 14.S5 £ 

10 him 559 N ftLft ft 52 81.576 -4.706 25.659 1.00 26.43 H 

him 560 CA ALA A 52 82.587 -5.793 25.714 1.00 29.44 £ 

him 561 C ALA A 52 83.687 -5.560 26.768 1.00 43.76 £ 

him 562 2 AAA ft 5? 84.502 -6.446 27.022 1.00 40.33 £ 

him 563 CB MA ft 52 83.228 -6.049 24.344 1.00 24.2S £ 

15 him 561 V SERA S3 83.702 -4.382 27. 385 1.00 31.96 

him 565 CA SEP. ft 53 84.705 -4.090 28.377 1.00 21.06 £ 

him 566 C SEE A 5? 64.196 -3.625 29.709 1.00 26.41 £ 

him 567 P SER ft 53 84.985 -3.492 30.611 1.00 36.12 £ 

him 568 CB SER ft 53 85.709 -3.088 27.843 1.00 14.22 £ 

20 him 56? PG SER A 53 85.140 -1.807 27.790 1.00 56.90 £ 

him 570 N GLD A 54 82.892 -3.431 29.874 1.00 22.38 ft 

him 571 CA ffiy A 51 82.380 -2.893 31.139 1.00 17.27 £ 

him 572 C CLP A 51 81.584 -3.735 32.118 1.00 26.32 £ 

him 573 p gIP A 51 81.229 -3.281 33.191 1.00 37.43 £ 

25 him 571 CB CLP A 51 81.677 -1.563 30.906 1.00 27.30 £ 

him 575 Cg CLP A 51 82.573 -P.513 30.262 1,PP 11,77 £ 

him 576 CP CLP A 51 83.669 -0.142 31.194 1.00 86.31 £ 

him 577 OE1 GLD A 54 83.392 -0.232 32.428 1.00 50.11 £ 

him 578 PE2 PEP A 51 81,785 0.198 30.692 l.OO S0.99 £ 

30 him 579 W ARC A 55 81.268 -4.971 31.804 1.00 29.63 H 

aiSM 58P CA Affg A 55 80.636 -5.748 32.854 1.00 33.32 £ 

him 581 C ARg A 55 79.347 -5.149 33.378 1.00 38.45 £ 

him 582 P ARG A 55 79.214 -4.897 34.576 1.00 40.1B £ 

him 583 CB ARC A 55 81.621 -5.875 34.045 1.00 57.61 £ 

35 him 581 Cg ARg A 55 B2.666 -7.028 33.960 1.00100.00 £ 

him 585 CP ARC A 55 82.805 -7. 805 35.305 1 .00100.00 £ 

him 586 ME ARG A SS 82.838 -9.270 35.146 1.00100.00 ft 

him 587 C8 ARg A 55 83.206 -10.129 36.102 1.00100.00 £ 

him 588 NH1 ARG A 55 83.583 -9.681 37.301 1.00100.00 JJ 

40 him 589 NH? ARG A SB 83.208 -11.440 35.855 1.00100.00 JJ 

him 5_2P_n HE A 56 78,367 -5.P29 32.191 1,PP 12.25 U 

him 59 1 CA TLB A 56 77.064 -4.434 32.794 1.00 25.49 £ 

him 592 C ILE A 56 75.982 -5.474 33.244 1.00 20.18 £ 

him 593 O ILE A 56 75.897 -6.579 32.704 1.00 24.74 £ 

45 him 594 CB ILE A 56 76.672 -3.512 31.531 1.00 26.89 £ 
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ATOM 595 cr.1 tt.f. A 56 77.643 -2.301 31.442 1.00 18.30 £ 

ATOM 596 Cft? TT.R A 56 75,214 -3,916 31,549 1,1)1) 19,64 £ 

atom 597 cm tt.b a 56 77.998 -1.936 30.026 i.pp go, 42 £ 

ATOM 598 W ASP A 57 75.166 -5.133 34.237 1.00 16.84 ft 

5 ATOM 599 CA ASP A 57 74.040 '5. 999 34,630 1.Q0 16,33 £ 

atom 600 c asp a 57 72.676 -5.451 34.123 1.00 28.40 £ 

atom 601 O ASP A S7 71.836 -6.198 33.657 1.00 25.50 £ 

atom 602 cb asp a 57 74,009 -6.194 36.164 1.00 16.94 C 

atom 603 cs asp a 57 75.369 -6.720 36.703 1.00 34.27 £ 

10 atom 604 opi asp a 57 75.875 -7.729 36.141 1.00 31.76 2 

ATOM 605 OD2 ASP A 57 76.040 -6.007 37.499 1.00 28.36 £ 

ATOM 606 W GLW A 58 72.443 -4.152 34.220 1.00 28.91 H 

ATOM 607 CA GLW A 58 71.183 -3.590 33.755 1.00 25.68 £ 

atom 608 c SIM a 58 71.425 -2.364 32. Ml 1.00 23.21 £ 

15 ATOM 609 O GLW A 58 72.403 -1.620 33.067 1.00 18.16 £ 

ATOM 610 CB CLW A 58 70.342 -3.151 34.946 1.00 33.14 £ 

atom 611 cs glw a 58 69.798 -4,341 35.80? 1.00 30,00 £ 

atom 612 cd glw a 58 69.226 -3.712 37.105 1.00 27.18 £ 

ATOM 613 OBI GLW A SB 68.722 -2.601 37.161 1.00 31.20 fi " 

20 ATOM 614 NE2GLWA 58 69.455 -4.436 38.186 1.00 16.89 H ' 

ATOM 615 N VAL A 59 70.496 -2.138 31.961 1.00 18.35 H 

atom 616 ca vax a 59 70.562 -Q.998 31.045 1.00 15.59 £ 

AISM 617 c val a 59 69.238 -0.240 31.039 1-00 26.28 £ 

ATOM 618 O VAL A 59 68.178 -0.820 30.762 1.00 19.51 Q 

25 ATOM 619 CB VAL A 59 70.707 -1.456 29.601 1.00 15.32 £ 

ATOM 620 CGI VAL A 59 7Q.47? -0.274 28.649 1,00 11.93 £ 

atom 621 cg2 val a 59 72.080 -2,111 29,364 1.P0 15.83 £ 

ATOM 622 W TYR A 60 69.306 l,p64 31.293 1.00 31.71 H 

atom 623 ca tyr a 60 6P.11? 1,927 31.197 1.00 21.40 £ 

30 atom 624 c tyr a 60 68,289 2.756 29.928 1,00 18.69 £ 

atom 625 o typ. a 60 69.250 3.532 29.796 1,00 15.51 £ 

atom 626 cb tyr a 60 68.021 2.817 32,413 1,00 17.24 £ 

atom 627 cg tyr a 60 67.493 2.131 33.658 1,PP 19.71 £ 

atom 628 cpi tyr a 60 68.345 1.583 34.586 l.PQ 21.14 £ 

35 ATOM 629 CD2 TYR A 60 66,154 2,22? 33.991 1.00 20.16 £ 

ATOM 630 CE1 TYR A 60 67.835 1.080 35.794 1.00 19.11 £ 

M£31 631 CE2 TYR A 60 65,648 1,698 35.163 1.00 10,77 £ 

ATOM 632 CZ TYR A 60 66.476 1,094 36.054 1.00 20.07 £ 

ATOM 633 OH TYR A 60 65.921 0.585 37.248 1.00 16.04 £ 

40 ATOM 634 M LEU A 61 67.491 2.452 28.916 1,00 17.46 H 

ATOM 635 CA LEU A 61 67.685 3,053 27.585 1.00 20.17 £ 

atom 636 c leu a 61 67.003 4.412 27.409 1,00 23.36 £ 

atom 637 o leu a 61 65.925 4.526 26.799 1.00 14,86 2 

ATOM 63B CB LEU A 61 67.g67 2.060 26.485 1.00 14.78 £ 

45 ATOM 639 CG LEU A 61 68.117 2.142 25.208 1.00 15.52 £ 
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640 CP1 LEV A 61 67.81S 1 nio 24.109 i.qq 7 .7S £ 

tXSSi $41 CP2 LEV ft 61 68.997 3.S41 24.5BO i.nn is.gn £ 

AlfiM 642 W MA ft 6? 67.656 5 434 27.956 1.00 20.35 g 

64? Cft AIA A 62 67.120 6. 784 27.96 3 1.00 18.55 Q 

5 ftlSM 644 C AIA A 62 67.779 7.739 26.949 1.00 IB. 57 £ 

64? P MA ft 62 67.455 8.924 26.9 20 1.00 24.31 Q 

aiCM 646 CB MA ft 62 67.071 7.377 29.439 l.oo n.69 £ 

KtSU 647 w MA ft 63 68,681 7.231 26.10 1 l.oo 14.09 g 

648 Cft MA ft 63 69.249 B. Q9S 25.052 1.00 12.B4 £ 

10 aiSSI £!£_£ ftLft ft 63 68.310 B OOS 23.877 1.00 27.00 £ 

ftlCM fi£fl 2 MA ft 63 67.845 6.916 23.511 1.00 24.51 £ 

OEM 651 CB MA ft 63 70.665 7.660 24.63 4 1.00 4.B9 £ 

6£2 — U MA A 64 68,076 9.148 23.26 2 l.oo 2i.ns g 

653 Cft AIA ft 64 67.202 9.286 22.0B6 1.00 13.50 £ 

15 654 C MA A 64 67.435 10.664 21.416 1.00 2B.OB £ 

fiS5_£ AIA ft 64 67.987 11 .600 22.021 l.OO 26.63 £ 

ttISM 656 CB ftLft ft 64 65.642 9.171 22.51 8 l.oo 7.63 £ 

arcai 6 57 w — lys ft 65 66.953 io.7ai 20.182 1.00 23 .9e g 

ai£ai 658 Cft LYS ft 65 66.966 12.0 12 19.409 1 .00 20,47 E 

20 atOl »9 g LYS ft 65 65.48B 12.443 19.5S1 1.00 24.37 £ 

*X2M 660 0 LYS ft 65 64.594 11.B07 1B.976 1.00 20.29 £ 

aiSM 661 CB LYS ft 65 67.317 11.658 17.951 1.00 25.59 £ 

OEM 662 CS LYS ft 65 66.808 12.6 30 16.923 1.00 27.54 £ 

AZBl 663 CP LYS ft 65 67.518 13.926 n.169 i.nn 2in« £ 

25 664 CE LYS ft 65 67.316 14.905 16.029 1.00 55.15 £ 

AEai 665 NZ LYS A 65 67.876 16.263 16.392 1.00 81.63 H 

aEal 666 W VAL A 66 65.228 13.362 20.485 1.00 22.47 H 

*I2H 667 CA VftL ft 66 63.B73 ii.bso 20.755 l.oo ib.99 £ 

EES C VAL ft 66 63.711 15.343 20.394 1.00 31.44 £ 

30 aiSM 569 0 VAL ft 66 64.665 16.1 07 20.460 1.00 34.61 £ 

670 CB VAL ft 66 63.440 13.623 22.204 1 .00 16.66 £ 

671 Cgl VAL A 66 64.269 12.6 23 22.B69 1.00 15.01 £ 

OEM 672 PG2 VAL A 66 63.379 14.9 04 22.950 1.00 19.21 £ 

673 N ELY A 67 62.S14 IS. 755 19.994 i.on 1« 03 g 

35 ^SM 674 CA S1I ft 67 62.298 17.149 19.614 1.00 14.90 £ 

621_£ SLY A 67 60.792 17.sib 19.SB5 i.nn 32.35 £ 

676 P gLY A 67 59.922 16. 666 19.8BB 1.00 18.88 £ 

OEM 6JZ2 H gLY A 68 60.503 1B.7B7 19.256 1 .00 23.21 g 

^33tt 678 Cft H.Y ft 6B 59.132 19.2 BB 19.183 1.00 23.83 £ 

40 622__E gLY ft 68 58.540 19.137 17.771 l.OO 19.31 £ 

MEM 680 0 gLY ft 68 59.165 18.550 16.B70 1.00 30.64 £ 

MEM ?B1 N ILE ft 6? 57.343 19.684 17.5BB 1.00 15.20 g 

662 CA ILE A 69 56.595 19.632 16.317 1.00 16.80 £ 

ILE A 6? 57.387 20.153 15.112 1.00 19.33 £ 

45 684 Q ILE ft 6? 57.425 19.519 14.061 1.00 14.66 £ 
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&T2M SIP CB ASH A 97 76.241 -8.201 26.46? 1.00 12.93 £ 

911 CS ASH ft 97 79.260 -7.897 25.407 1.00 24.91 £ 

filSM 912 PP1 ASW A 97 80.331 -a.518 25.375 1.00 57.17 £ 

AISM 913 ND2 ASM A 97 78.839 -7 135 24.392 1.00 34.88 ft 

5 £124 914 H ASP ft 98 75.666 -10.732 27.055 1.00 27.9B ft 

ftlSM 915 Cft ASP A 98 74.907 -11.361 28.089 1.00 29.25 £ 

ftTO! 91g C ftSf ft ?B 74.400 -10.379 29.164 1.00 37.53 £ 

ATOM 917 P ftSP ft 98 74.505 -10.634 30.367 1.00 36.42 Q 

ftlfiM 918 CB ftSP ft 98 75.791 -12.450 28.700 1.00 36.37 £ 

10 ftUM 919 Cg ASP ft 98 75.016 -13.712 29.053 1.00 88.62 £ 

ftlSM 92 p PP1 ASP ft 98 73.775 -13.749 28.877 1.00 82.53 £ 

a!£M 921 0P2 ASP ft 98 75.656 -14.670 29.542 1.00100.00 £ 

AISM 222 H VM. A 99 73.879 -9.235 28.730 1 .00 27 13 U 

923 Cft VftL ft 99 73.157 -8.351 29.63S 1.00 21.57 £ 

15 aiSH 224 £ VftJt ft 99 71.706 -8.868 29.530 1.00 16.15 £ 

AI2M 925 p VAX ft 99 71.159 -9.088 28.422 1.00 19.47 Q 

AI2M 326 CP VAX A 99 73.264 -6. 900 29.206 1.00 24.18 £ 

AISM 927 Cgl VAX A 99 72.517 -6.015 30.198 i.oo usa £ 

AlfiM 928 Cg2 VAX A 99 74.720 -6. S15 29.225 1.00 30.10 £ 

20 &XSM 929 H ASH A 1PP 71.149 -9.262 30.662 1.00 17.39 fl 

A1S21 239 Cft ASH A 1PP 69.852 -9.925 30.613 1.00 2S.77 £ 

AT2M 222 £ ASW A 1PP 68.648 -9.034 30.910 1.00 24.95 £ 

&I£M 932 P ASW A 100 67.498 -9.377 30.582 1 .00 20 88 £ 

AX2M 233 CP ASW ft IpQ 69.846 -11.157 31.527 1.00 14.98 £ 

25 AI2H 934 Cg ASH A 1PP 68.724 -12.112 31.180 1.00 20.38 £ 

M 935 PP1 ASH A 1PP 68.737 -12. 709 30.100 1.00 29.59 £ 

AISM 936 NP2 ASW A 1PP 67.716 -12. 240 32.076 1.00 16.35 M 

AX2M ?37 W XYS A 1P1 68.941 -7.923 31.584 1.00 17.91 ft 

AI2M 939 CA XYS A 1P1 67.970 -6.916 31.994 1.00 25.43 £ 

30 A123 939 C E-YS A 1P1 68.107 -5.510 31.323 1.00 2S.29 £ 

ATOM ilfi Q XYS A 1P1 69.151 -4. 850 31.377 1.00 19.88 £ 

ATCM m CP XYS A 1P1 67.996 -6. B07 33.521 1.00 29.28 £ 

aiOa 242 CP XYS A 1P1 67.464 -B.0S4 34.205 1.00 9.31 £ 

A12M 243 CP XYS A 1P1 67.218 -7.719 35.668 1.00 38.93 £ 

35 AISM 244 CE XYS A 1P1 66.206 -6.569 35.885 1.00 13.38 £ 

A1£M 245 HZ XYS A 1P1 64.750 -7.006 35.825 1.00 15.26 E 

ATOM 246 H LEV A 1P2 67.013 -5. 043 30.732 1.00 22.22 n 

AICM 242 CA LED A 102 67.003 -3.744 30.092 1 .00 IS. 40 £ 

ATOM 248 C XEV A 192 65.612 -3.115 30.156 1.00 18.55 £ 

40 AZSM 249 O XEV A IP? 64.590 -3.811 30.102 1.00 18.92 £ 

ATOM 25P Cg XEV A 1P2 67.465 -3.898 28.636 1.00 11.23 £ 

AI2M 251 Cg XEV A IP? 67.553 -2.711 27.651 1.00 15.51 £ 

Aiaa 252 CP1 XEV A 1P2 68.628 -2.985 26.559 1.00 9.65 £ 

ATPK 253 CP2 XEV ft 192 66.162 -2.407 26.995 1.00 13.10 £ 

4 5 A1SM 254 H XEV ft 1P3 65.595 -1.798 30.318 1.00 17.05 ft 
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ATOM 955 CA T.EU A 103 64,356 -1.036 30.265 l-flll 1K.M £ 

ATOM 956 C T.RO A 103 64.346 -0.07? 79.046 1.00 19.65 £ 

ATOM 957 O LEU A 103 65.215 0.7B9 28.875 1.00 19.6B £ 

ATOM 958 CB A 103 64-099 -0.289 31.562 1.00 12.28 £ 

5 ATOM 959 CG LEU A 103 62.6B6 0,259 31,594, 1 , QQ H,» £ 

ATOM 960 CD1 LEU A 103 61.645 -0.822 31.902 1.00 10.31 £ 

atom 961 cd2 led a 103 62.646 1.36P 32.601 l.PP 12. 3P £ 

ATOM 962 W PHE A 104 63 .417 -0.333 28.140 1.00 16.41 K 

ATOM 963 CA PHE A 104 63.21? 0,486 26.956 l.PP IP. 32 £ 

10 ATOM 964 C PHE A 104 62 .126 1.546 27.249 1.00 21.85 , £ 

ATOM 965 O PHE A 104 61.168 1.271 27.992 1.00 18.36 £ 

atom 966 cb phe a 104 62.796 -P.396 25.793 l.PP ?.96 £ 

ATOM 967 CG PHE A 104 62.732 0.348 24.508 1.00 16.81 £ 

ATOM 968 CD1 PHE A 104 63.894 0.714 23.840 1.00 25.04 £ 

15 ATOM 969 CD2 PHE A 104 61.511 0.795 24.005 1.00 22.59 £ 

ATOM 970 CE1 PHE A 104 63.836 1.448 22.619 1.00 31.26 £ 

ATOM 971 CE2 PHE A 104 61.449 1.535 22.814 1.00 15.59 £ 

ATOM 972 CZ PHE A 104 62.625 1,895 22.139 1,00 11.67 £ 

atom 973 w leu a los 62.341 2,762 26.734 l.PP 2P.33 U 

20 ATOM 974 CA LEU A 105 61 .416 3.897 26.904 1.00 18.10 £ 

atom 975 c My ft IPS 60-711 4,237 25.634 l.PP 17.04 £ 

atom 976 o lev ft IP? 61,315 4 , 680 24,665 1.00 18,93 Q 

atom 977 cb leu A 105 62.178 5.146 27.214 1,PP 17.49 £ 

ATOM 978 CG LEU A 105 62.434 5.544 28.644 1.00 27.17 . £ 

25 ATOM 979 CD1 LEU A 105 62.630 4,349 29,5741 1,PP 19,16 £ 

ATOM 980 CD2 LEU A 105 63,688 6,347 28.529 1,00 23,5? £ 

ATOM 981 W GLY A 106 59.407 4,153 25,652 l.PP 2P.66 H 

ATOM 982 CA GLY A 106 58.679 4.536 24.455 1,PP 21,03 £ 

ATOM 983 C GLY A 106 58.080 5.935 24.597 1.00 17.32 £ 

30 ATOM 984 O GLY A 106 58.690 6.858 25.113 1.00 26.89 £ 

ATOM 985 W SER A 107 56.831 6.047 24.219 1.00 22.05 fl 

atom 9B6 ca ser a 107 56,177 7,317 24.2PP 1,PP 22,12 £ 

&2QM 987 C SEP ft. 1P7 54,686 7,212 23,923 1-PP 19, P6 £ 

ATOM 988 O SER A 107 54 .314 6.545 22.963 1.00 27.42 £ 

35 ATOM 989 CB SER A 107 56.882 8.232 23.3PP l.PP 20.99 £ 

ATOM 990 OG SER A 107 55-947 9' 133 22,776 1 , OP 42.85 £ 

atom 991 n SER ft log 53,826 7,89P 24,671 l.PP 27,42 H 

atom 992 ca ser a 108 52.382 7.947 24.339 l.OP 26.43 £ 

ATOM 993 C SER A 108 52.144 8,259 22.842 1 rOP 30.97 £ 

40 atom 994 o ser a 108 51,242 7.709 22.217 l.PP 33,46 £ 

ATOM 995 CB SER A 108 51.710 9.072 25.144 1.00 19.87 £ 

ATOM 996 OG SER A 108 52.495 10.266 25.071 1.00 70.98 £ 

ATOM 997 W CYS A 109 52.927 9.180 22.278 1.00 24.73 H 

ATOM 998 CA CYS A 109 52.728 9,549 20.880 1.P0 25.61 £ 

45 ATOM 999 C CYS A 109 52.970 8,482 19,815 1.00 21,29 £ 
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Mai 1045 rni t.bii a 114 40.991 8,797 13,504 1.9P 19.29 £ 

it™ 1046 PP? T.TO A 114 41 139 10.51? 1S.300 1.00 ?6.BS £ 

ATOM 1047 W ATA A US 43.103 6,473 18,527 l.PO 29, PP H 

atom 1048 r-A at.a A 115 42,920 5,446 19,52? l.pp 25,66 C 

5 atom 1049 r. ata a 115 41,722 5.727 2P.151 l.PP 28.76 £ 

ATOM 1050 O ATA A 115 41.364 6,855 2P.682 l,pp 21.12 Q 

atom 1051 m MX) ns 44.177 5.272 2P.326 l.PP 16,86 C 

ATOM 105? W IYS A 116 41.137 4.675 20.998 1.00 30.21 H 

atom 1053 pa t.ys a 116 40.036 1,192 21.928 1,PP 25.85 £ 

10 atom 1054 p lys a 116 40,66g 5.218 23.195 l.PP H.18 £ 

atom 1055 O T.YS A 116 41.750 4.791 23.535 1.00 23.51 O 

atom 1056 r* t.ys a 116 39.369 3.415 22.116 l.PP 22,05 £ 

atom 1057 pc t.ys a lie 39.053 3.P32 23.524 l.PP 55.38 £ 

ATOM 1058 PP LYS A 116 37.963 1.955 23.549 1.00100.00 £ 

15 atom 1059 ck lys a 116 37,1 2 P 1.953 24.8?5 1.PP1PP.PP £ 

ATOM 1060 MS LYS A 116 35.767 1.31P 21.63P 1.PP1P0.PP H 

atom 1061 n glm a 117 10,021 6.2P8 23.856 l.PP 18.23 H 

atom 1062 pa glw a 117 IP. 156 6,757 25.18P l.PP 21. PI £ 

atom 1063 c CLN A 117 39,695 6.178 26.383 l.PP 3P.96 £ 

20 ATOM 1064 O GT.W A 117 38.483 6.PP9 26.345 l.PP 27,66 Q 

atom 1065 pr glw a 117 40.215 8.263 25.179 l.PP 11.32 £ 

ATOM 1066 CG SLW A 117 40.849 8.912 23.948 l.PP 12.12 £ 

atom 1067 pp glm a 117 42.1P4 8.623 23.954 l.PP 21. IP £ 

atom 1068 obi glm a 117 43.P11 8.628 22.896 1,PP 17.88 Q 

25 atom 1069 wf.2 glw a 117 13.P01 8.953 25.131 l.PP 11-24 _H 

atom 1070 w pro a n b 40.374 5.992 27.199 l.PP 3P.P2 H 

atom 1071 pa pro a 118 41,g?<? 6.191 27.655 l.PP 26-11 £ 

atom 1072 p pro a 1 18 42.450 5.P5P 26,899 l.PP 21.37 £ 

atom 1073 o pro a iib 41.792 1,P27 26.726 l.PP 25,31 Q 

30 ATOM 1074 CB PRO A 118 4 9 055 5.994 29.167 1.00 23. 69 £ 

atom 107S pg pro a 118 1P.847 5.24P 29.654 l.PP 23. 2P £ 

atom 1076 pd pro a lie 39 . 695 5.519 28.7P9 l.PP 15.79 £ 

ATOM 1077 M MET A 119 43.684 5.228 26.132 1,PP 16. PP H 

atom 1078 pa met a 119 11,37? 1.215 25.611 l.PP 1P.8P £ 

35 atom 1079 p met a 119 15,062 3.P83 26.411 l . PQ 23.61 £ 

atom iobo o met a 119 16,013 3.281 27.2P9 1,00 18. 0 2 B 

atom 1081 pb met a 119 15.384 1-891 21.791 l.PP 13.52 £ 

atom 10B2 pg met a 119 41,801 6.P11 23.989 l.PP 18,52 £ 

ATOM 1083 SP MET A 119 46.157 7.054 23.271 l.PP 26.27 5. 

40 atom 10B4 pe met a 1 19 46.264 6.524 21,845 l.PP 33,79 £ 

ATOM 1085 W ALA A 120 44.559 1.875 26.271 l.PP 26.61 H 

atom 1086 pa Ala a 120 45.177 P. 712 26,881 l.PP 29.17 £ 

ATOM 1087 P ALA A 120 46.356 P.3P8 25.981 l.PP 23.23 £ 

ATOM 1088 O ALA A 120 4 6 419 0.7S9 24.833 1.00 20.19 £ 

45 atom 1089 pb ala a 120 11,169 -P.119 26.911 l.PP 26.0 2 £ 
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OEM lfi?°__H gLV ft 121 47.238 -0.553 26.507 J .00 12.30 g 

ftXOl 1091 CA ffW ft 171 48.427 -1. 009 25.788 1.00 9.45 £ 

azai 1092 C SEP A 121 48.070 -1.697 24.4.sn 1.00 n.ca c 

1H22_2 gIV ft 121 48.B2B -1.670 23.450 1.00 14. 84 Q 

5 aiSM 1091 CB glV ft 1?1 49.321 -1.883 21,215 1.00 16.74 £ 

MSM 109? Cg gLV ft m 50.132 -1.122 27.763 1.00 18.14 £ 

1096 CP GUI ft 121 49.458 -I.O OO 29.137 1.00 13.00 £ 

1097 OBI gLV ft 121 48.252 -1.294 29.276 1.00 20.79 2 

ftlCM 1Q2& CE2 gLV ft 121 50.123 -0.521 30.080 1.00 17.86 Q 

10 aias — 1099 w — SEP a 122 46.887 -2. 273 24.409 1.00 11.79 H 

MSM HOP CA SER ft 1?? 46.427 -2.977 23.218 1 00 12 16 £ 

aifll 1123— C SER ft 122 46.030 -2.058 22.100 i.on 11.70 £ 

aiQl UP2 P SER A 122 45.717 -2. 529 21.010 1.00 13.91 Q 

&X£M UP3 CP SER ft 122 45.186 -3. 7B1 23.568 1.00 21.50 £ 

15 MEM 11PJ Qg SER ft 1?? 44.143 -2.908 23.976 1 .00 28.52 £ 

»P5 W g^V ft 12? 46.041 -0.754 22.341 1.00 14.65 H 

ftlSH 1106 CA gLV A 123 45.783 0. 202 21 .243 l.OO 17.15 £ 

ftISM 1107 C gLV A 123 46.959 0.313 20.240 1.00 11,48 £ 

ftlSM HOP 0 gLV A 123 16.921 0.844 19.14 1 1.00 11.19 q 

20 ft»3M HOP CB gLV ft 123 45.491 1.600 21.80 5 i.oo 21.66 £ 

ftiai U1P Cg gLV ft 123 41.127 1.694 22.523 1 , 00 94 .68 £ 

ftlGM 1111 CP gLV ft 12? 42,984 1.374 21.585 i.oo as .56 £ 

ftISM 1112 CE1 gLV ft 123 43.019 1.B65 20.426 1.00 41.73 £ 

Aiai 1113 0E2 gLV ft 123 42.158 0.497 21.940 1.00300.00 o 

25 ftlCM UK W LEV ft 121 48.134 -0.^5 20.618 1 .00 14 02 H 

ftlSH 1115 CA LEV ft 121 49.296 -0.082 19.740 1.00 15.32 £ 

ftISM 1116 C LEV ft 121 49.082 -0. 754 18.458 1.00 17.76 £ 

1112—2 LEV A 121 48.752 -1.917 18.445 1.00 18.91 £ 

ftISM HI? CP LBV ft 121 50.564 -O.6B0 20.362 1.00 18.07 £ 

30 ftlStl 1119 CP LEV ft 121 51.922 -0.2 22 19.803 1.00 21.52 £ 

ll?p CP1 LEV ft 121 52.080 1.258 20.117 1.00 20.35 £ 

&X2M 1121 CD2 LEU A 124 53.042 -0.919 20.550 1.00 14.07 £ 

ftlffl* H22 W LEV A 125 49.514 -0.071 17.409 1.00 18.44 H 

ftI2* 112? CA LEV A 125 49.445 -0.S64 16.052 1.00 19.92 £ 

35 aXOl "21 C LEV A 12? 48.034 -0.754 15.509 1.00 95.56 £ 

aiSM 1125 Q LEV A 125 47.854 -1.1 88 14.364 1.00 18.26 Q 

ftlGM 1126 CB LEV A 125 50.355 -l. BOO 15.840 1.00 20.79 £ 

ftI2M 1127 Cg LEV A 125 51.890 -i.su is.778 1.00 17.21 £ 

1129 CP1 LEV A 125 52.744 -2.649 16.316 1.00 19.95 £ 

40 1129 CP2 LEV A 125 52.334 -1.219 14.338 1.00 5.81 £ 

1130 H gLW A 126 47.027 -0. 327 16.276 1.00 21 .97 

ftXCM 1121 CA gJJLA 126 45.652 -0.504 15.790 1.00 19.97 £ 

ft3£M U3? C SLW A 126 15.213 0.447 14.724 i.oo 2an £ 

OEM U13__fi gLN A 126 11.076 0.391 14.293 1.00 47.49 q 

45 1121 CB PLN A 126 44.652 -0.404 16. 911 1.00 19. 87 £ 
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ATOM 1135 r.a GT.M A 126 44 049 -1.31? 1B.04B l.Qnifl.39 £ 

ATOM 1136 C.T> GT.M A 126 44.319 -2.626 17.835 l.OOfifi.BO £ 

ATOM 1137 OF.1 GLN A 126 44,064 -3.376 18,792 1 , Q0 4Q.75 Q 

ATOM 1138 NE2 GUI A 126 44-P15 -2.952 16,565 l.QQ 71,74 H 

5 ATOM 1139 M GT.Y A 127 46.080 1.33Q 14.270 1,00 28.29 H 

ATOM 1140 PA GT.Y A 127 45.627 2.260 13.252 1 , 00 21x21 £ 

ATOM 1141 C GLY A 127 jfiJjfiZ 3,315 12.953 l.QQ 22.90 £ 

atom 1142 o gly a 127 47.755 3.251 13.474 1.00 25.30 2 

ATOM 1143 M THR A 12B 46.311 4.219 12.046 1.00 19.51 H 

10 A12M 1144 Cft THR ft 129 47.149 5.314 11.588 l.PP 22.12 £ 

atom 1145 c thr a 128 47.705 6.219 12.695 1.00 22.60 £ 

ATOM 1146 O THR A 128 47.061 6.461 13.731 1.00 18.58 2 

ATOM 1147 CB THR A 128 46.392 6.182 10.544 1.00 35.98 £ 

atom 1148 OGi thr a 128 46.53? 5.594 9.239 1.00 58.05 £ 

15 ATOM 1149 CG2 THR A 128 11^12 7.639 10.542 1.00 43.41 £ 

ATOM 1150 W LEU A 129 48.907 6.715 12.425 1.00 18.32 H 

ATOM 1151 CA T.EU A 129 49.674 7.534 13.356 1.00 16.76 £ 

MSM 1152 C * LEV ft 129 49.504 8.959 12.967 1.00 4.89 £ 

AISM 1153 p leu a 129 49.232 9.260 11.814 IrPP 16.1,4 2 

20 ATOM 1154 CB LEU A 129 51.205 7.191 13.261 1.00 17.91 £ 

ATOM 1155 CG LEU A 129 51.769 5.804 13.752 1.00 18.21 £ 

ATOM 1156 CD1 LEO A 129 53.132 5.379 13.193 1.00 12,12 £ 

ATOM 1157 CD2 LETT A 129 51.683 5.532 15.251 1.00 3.89 £ 

atom iisb m SLg ft 130 49,816 9.827 13.917 l.PO 10,23 U 

25 ATOM 1159 CA GUI A 130 49.912 11.268 13.691 1.00 13.22 £ 

&I2M 1160 C BLH ft 130 51.128 11.544 12.775 1.00 23.44 £ 

atom 1161 o glu a 130 52.249 11.162 13.P9P l.PP 21.23 2 

ATOM 1162 CB GLU A 130 50.150 11.979 15.035 1.00 18.48 £ 

ATOM 1163 CG GLU A 130 50.754 13.376 14.886 1.00 77.44 £ 

30 ATOM 1164 CD GLD A 130 49.B33 14.328 14.121 1.00100.00 £ 

ATOM 1165 OE1 GLU A 130 48.588 14.205 14.340 1.00 36.19 2 

atom 1166 QE2 glu a 130 50.347 15.161 13.295 l.PP 21.03 2 

Mm 1167 W PRO ft 131 50.920 12.219 11.648 l.OP 21.35 H 

atom 1168 ca pro a 131 52.023 12.409 10.731 IrPP 14,78 £ 

35 ATOM 1169 C PRO A 131 53.201 13.132 11.265 1.00 14.98 £ 

ATOM 1170 O PRO A 131 54.325 12.847 10.853 1.00 20.99 £ 

atom 1171 cb pro a 131 51.413 13.154 9.552 l.OP 14.76 _£ 

atom 1172 cg pro a 131 50.071 13.485 9.949 1,00 20.99 £ 

ATOM 1173 CD PRO A 131 49.641 12.626 11 .047 1.00 17.25 £ 

40 ATOM 1174 N THR A 132 52 .986 14.095 12.159 1.00 18.77 U 

ATOM 1175 CA THR A 132 54.131 14.838 12.689 1.00 16.44 £ 

£J£H 1176 C THR h 132 , 55,102 13.951 13.408 l.PP 21.91 £ 

ATOM 1177 O THR A 132 56.317 14.088 13.234 1.00 24.17 £ 

ATOM 1178 CR THR A 132 53 71 6 15.907 13.606 1.00 23.45 £ 

45 ATOM 1179 £g] THR A 132 52.076 16.883 12.850 1.00 31.15 £ 
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1 .00 35.28 
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7.869 


1.00 13.44 
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1.00 15.72 
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1.00 17.38 
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1.00 11.77 
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J* . . 
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59.246 


3.494 


11.192 


1.00 16.20 
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atom 1225 O tt.r A 138 59,3Q7 2.377 14.970 i on 13.79 £ 

ATOM 1226 CB TT.P. A 138 58, Q64 4.172 12.316 1.00 17.85 £ 

ATOM 1227 CS1 HE A 138 56, gap 4.473 11.757 1.00 28.21 £ 

ATOM 1228 C62 TLB A 138 5g,674 2.986 11,602 1,00 9.81 £ 

5 ATOM 1229 CD1 TLE A 138 55.695 3,376 H.970 1,00 18.17 £ 

ATOM 1230 N AT.A A 139 60.243 j^£l 14.625 1,00 11.54 II 

ATOM 1231 CA AIA A 139 gJLjgJ 3,937 15,288 1.00 13.22 C 

AX2M 1232 c ftIA ft 139 61.Z5{ 3.364 16.675 1.00 18.73 £ 

ATOM 1233 o AIA A 1 39 61.791 2,318 17.031 1.00 20.44 2 

10 atom. 1234 ALA A 139 62,434 5,073 15.390 1.00 13.62 £ 

him 1235 N lys a 140 60.397 4.033 17.448 1,00 16.36 II 

ATOM 1236 CA LYS A 140 60.083 3.600 18.815 1.00 15.14 £ 

atom, 1237 c lys a 140 59,392 2,262 18,824 1,00 15.18 £ 

atom 1238 o lys a 140 59.824 1.346 19.475 1.00 21.42 i £ 

15 atom 1239 cb lys a 140 59.133 4.606 19.525 1.00 17.86 £ 

AI£2I 1240 cg lys a 140 59,925 5.806 20.152 1.00 21.11 £ 

atom 1241 cd lys a 140 61.208 5.478 20.958 1,00 16.75 £ 

atom 1242 ce lys a 140 61.664 6.735 21.835 1.00 10.06 £ 

atom 1243 wz lys a 140 62.688 6.496 22.921 1.00 14.40 II 

20 atom 1244 n u& a 141 58.356 2.116 18.027 1.00 11.49 H 

atom 1245 ca TLB a 141 57,703 0.828 17.977 1.00 17.92 £ 

him 1246 c UE A 141 58.72? -Q.28? 17.577 1,00 13.46 £ 

ATOM 1247 O TLB A 141 58.730 -1.374 18.148 1.00 13.92 Q 

ATOM 1248 CB TLB A 141 56.497 0,925 17.019 1,00 22.59 £ 

25 ATOM 1249 CC1 TLB A 141 55.466 1.906 17.557 1.00 17.61 C 

ATOM 1250 CS2 TLB A 141 55,863 -Q.411 16.700 1.00 10.49 £ 

atom 1251 cpi tle a 141 54.530 2.327 16.449 1.00 13.43 £ 

ATOM 1252 N ALA A 142 59.637 0,028 16.650 1.00 10.29 U 

ATOM 1253 CA ALA A 142 60.657 -0.931 16.228 1.00 7.15 £ 

30 ATOM 1254 C ALA A 142 61.456 -1.301 17.456 1.00 16.58 £ 

atom 1255 o ala a 142 61,839 -2,454 17,621 1,00 13,04 £ 

AIGM 1256 cb ala a 142 61,604 -0,288 15.130 1,00 4.44 £ 

atom 1257 n gly a 143 61.703 -Q.307 18.316 1.00 9.56 II 

ATOM 1258 CA GLY A 143 62.448 -0.525 19.527 1.00 5.15 £ 

35 ATOM 1259 C GLY A 143 61.770 -1.555 20.430 1.00 16.36 £ 

ATOM 1260 O GLY A 143 62.392 -2.482 20.967 1.00 14.11 £ 

him 1261 w ile A 144 60.476 -1,418 20.564 1.00 20.33 H 

ATOM 1262 CA TLB A 144 59.725 -2.314 21.407 1.00 15.35 £ 

atom 1263 c tle a 144 59.706 -3.732 20.859 1.00 19.84 £ 

40 him 1264 o tle a 144 59.836 -4.700 21.608 1.00 17.93 £ 

him 1265 CB TLB A 144 58,317 -1.819 21-55? l.PQ 1Q.60 £ 

him 1266 Cgl ILE a 144 58,311 -0,610 22.516 1.00 9.80 £ 

ATOM 1267 CG2 TLB A 144 57.410 -2.928 22.122 1.00 9.60 £ 

ATOM 1268 mi TLB A 144 57.022 Q.Q76 22.517 1.00 18.32 £ 

45 atom 1269 w t.ys a 145 59,520 -3,841 19.556 1.00 7.20 H 
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17.759 


1.00 21,84 
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1.00 17,94 


c 




ATOM 


1285 


CD2 LEU A 146 
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1.00 16.10 
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1.00 7.91 
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22.893 


1.00 9.14 
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-7.378 
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1.00 22.7? 
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62.232 
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23.809 


1.00 12.63 
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63.411 


-3,623 


24.316 


1.00 15,02 
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-7.352 . 
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1.00 20. flj 
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ATOM 


1293 


_CA 


GLO A 148 


.60.016 


-9.555_ 


22.567 


1.00 16.09 


K 

c 


25 


ATOM 


1294 


_C_. 


GLO A 148 


60.685 


-9,715 


21.992 


1.00 22^1 






ATOM 


12?5 


. P 


GLO A 148 


69.651 • 


-10.888 


22.226 


1.00 12.05 


c 




ATOM 


129? 


CP 


GLO A 148 


58.597 


-8.268 


22.946 .. 


1.00 14 T f5fi 


o 




ATOM 


1297 


. CP 


GLO A 148 


57.864 


-7.189 


22. 840 


1.00 11.45 


c 




ATOM 


129P 


CP 


GLO A 148 


. 56.471 , 


"6rP21 


22.277 


1.00 11.75 


c 


30 


ATOM 


1299 


OE1 GLU A 148 


.56.117 


-7.955 


21,099 


1.00 11.65 






ATOM 


1300 


PE2 


GLO A 148 


55.728 


"6r2?l 


23,981 


1.00 22. £f5 


0 




ATOM 


1391 


JK 


SER A 149 


61.368 


-9.377 


29,715 


1.00 15.57 


0 




ATOM 


1302 


_CA 


&eb * ™<> 


61.938 - 


-10.428 


19,897 


1.00 lO.J] 


N 




ATOM 


130? 


C 


SER A 149 


63.040 - 


11.215 


29,592 


1.00 15.83 


c 


35 


ATOM 


1?04 


O 


SER A 149 


63.102 - 


12.458 


20.291 


1.00 12.72 


c 




ATOM 


13P5 


CP 


SER A 149 


62.270 


-9.936 


18.488 


1.00 9.4^ 


o 




ATOM 


1305 


05 


SER A 149 


61.053 


-9,659 


17,792 


1.00 15.91 


c 




ATOM 


J3Q7 


V 


TYR A 150 


63.910 - 


19-546 


21,224 


1.00 18.44 


0 




ATOM 


1308 


CA 


TYR A 150 


. 65.965 - 


U.1PP 


21.91P 


1.00 20.50 


N 


40 


ATOM 


1399 


C 


TYR A ISO 


64,514 - 


11,918 


23.158 


1.00 21.87 


c 




ATOM 


1319 


o 


TYR A ISO 


64.939 - 


12.949 


23.486 


1.00 31.39 


c 




ATOM 


1311 


CB 


TYR A 150 


66.005 


-9.950 


22.425 


1.00 13.71 


0 




ATOM 


1312 


CO 


TYR A 150 


66.994 


-9.599 


21.365 


1.00 14.13 


c 




ATOM 


1313 


CD1 TYR A 150 


. J6.611 . 


-9,673 


29.317 . 


1.00 14.64 


. . c 


45 


ATOM 


1314 


CD2 TYR A ISO 


-...69.298 - 


10.000 


21.369 


1.00 18.3? 


0 

c 
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a-mm 131s pf.i tyr a 150 -8.39P 19,219 l.PP 11 . 91 £ 

Atom 1316 C.V.7 TYR A ISO 69 1 »H -0.682 20.345 1.00 11.10 £ 

flTOM 1317 r? TYR A 1*0 68 -fl.900 19.326 1.00 20. 9S £ 

ATOM 1319 OH TYR A ISO 69.739 -8-685 18.333 1.00 27.73 £ 

5 ftTOM 1319 M ASM A 151 63.536 11.249 23.8P1 l.pp H.B3 U 

R tam 1320 TA ASM A 1 -SI 62 -903 -11.889 24.937 1.00 23.62 £ 

ft tom 1321 r- asma isi 62.417-13.244 24.410 l.QP 28.53 _£ 

»tom 1322 n asm a 151 fi?,f3P -14.24P 25 . P72 l.PP 25,89 2 

»tom 1323 PR ASW A 151 61 .655 -11.113 25.439 1.00 20.95 £ 

10 atom 1324 re ASM A 1 51 61.989 -9.867 26.264 1,00 15.07 £ 

>tom 1325 nm asm a 151 61,126 -9,Q?Q 26.466 l.PP 26.72 £ 

*™m 1326 wn? asm a 151 63.231 -9.7P9 26.7PP l.PP 6.33 H 

««» 1327 m arg a 152 61,731 -13.249 23.25? l.pp 19.91 II 

?™m 1398 c a arg a 152 61.129 -14.465 22.687 l.PP 17.62 £ 

15 *TOM 1329 r arg a 152 62.09P -15.523 22 . 188 l.PP 21.34 £ 

atom 1330 o arg a 152 61.959 -16.6P7 72.542 l.PP 15.44 2 

VMM 1331 ™ ARG A 152 6P.PB6 -H . 148 21.61P l.PP 15. 3P £ 

atom 1332 rr. .arg a 152 58.672 -13.754 72.157 1,PP 17.22 £ 

vmm 1333 rn ARG A 152 57.65? -13.297 21.049 l.PP 9.11 £ 

20 VMM 1334 mr arg a 152 57.161 -14. 419 2P.241 l.PP 21. P5 H 

vmm 1335 r* arg a 152 57.159 -14.447 18.912 l.PP 28.61 £ 

vmm 1336 MHi arg a 152 s 7 , s9p -13.387 18. 2 21 l.PP 2 1.98 H 

VMM 1337 MH? ARG A 152 56.717 -IS. 538 18.262 1.00 26.11 N 

vmm 1338 m gt.m a 153 63.P9S -15.1P4 71.434 l.PP 16.54 H 

25 atom 1339 rm gt.m a 153 64.P44 -16.P36 2P.842 l.PP 9.74 £ 

ATOM 1340 C GLM A 153 65 -08? -16.443 21.807 1.00 16. 7P £ 

atom 1341 n gt.m a 153 65.529 -17.545 21.763 l.PP 24.35 £ 

ATOM 1342 r.R GLM A 1 5 3 64.789 -15.372 19.714 l.PP P. 99 £ 

ATOM 1343 r-G GLM A 153 65-935 -16.225 19.116 1.00 4.63 £ 

30 atom 1344 rn glm a 153 66.315 -15.637 17.762 l.PP 14.17 £ 

atom 1345 Q R1 gt.m a 153 65 . 611 -14. 763 17.254 l.PP 12.53 £ 

atom 1346 mr? glm a 153 67.466 -16.P24 17.228 l.PP 13.38 H 

atom 1347 m tyr a 154 65 . 566 -15.518 72 . 6P8 l.PP 14.35 M 

ATOM 1348 TA TYR A 154 66 E77 -15.839 23.483 1.00 12.16 £ 

35 atom 1349 c tyr a 154 66 . 373 -15.93P 24.954 l.PP 19. P6 £ 

atom 1350 a tyr a 154 67 , 185 -16.2P7 25 . 777 l.PP 25.59 2 

atom 1351 ™ tyr a 154 67 . 829 -14.816 73.326 l.PP 16.89 £ 

atom 135? c fi T ™ * 15 « 68.418 -14.733 71,943 1.PP 17.53 : £ 

vmm 1353 m i tyr a 154 fi9 , ?S9 -15.7 2 6 71,467 l.PP 18.91 £ 

40 atom 1354 rn? tyr a 154 68.nnn -13.712 21.091 1,00 13.97 . £ 

vmm 1355 r*i tyr a 154 69 . 762 -15 . 686 70 . 190 l . PP 1P.*8 £ 

vmm 1356 CE2 tyr a 154 66.621 -13.639 19 . PP6 l.PP 2 3.81 £ 

atom 1357 c* tyr a 154 6«».468 -14.634 19.38P l.PP 23.P8 £ 

atom 1358 oh tyr a 154 7P , PP2 -14.619 16.11P l.PP 23.87 £ 

45 atom 1359 » gt.y a 15S 65 08P . -15.686 75 . 313 l.PP 12. P8 H 
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1.00 18.12 






ATOM ,_ 


1399 


.CP 


ARG A 159 


60,991 




31.P34 . 


1.00 21.68 


0 


40 


ATOM 


1399 


CG 


ARG A 159 


60.986 


-6.029 


31.722 


1.00 16.41 


c 




ATOM 


HPO 


CP 


ARG A 159 


61.135 


-5.Q52 


33.233 


1.00 IB. 10 


c 




ATOM 


1401 ,_ 


..HE 


ARG A 159 


61.3P5 


-7,102 


33.772 


1.00 19.25 


c 

M 




ATOM 


14P2 


CZ 


ARG A 159 


61.164 


-7. 720 


35.959 


1.00 36.67 


c 




ATOM 


1403 


rail 


ARG A 159 


6C9P6 


-e,77e 


35.9K 


1.00 15.32 




45 


ATOM 


1191 


NH2 


ARG A 159 


61.30? 




35. 44P , 


1.00 11.79 


If 
If 
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atom 1405 » SEE * 52xl&2 -2,661 28.859 1.00 24.44 M 

»Tm 1406 r » ser a 160 £3 . 312 =1*323 29.200 1,00 21.?? £ 

it™ 1407 fi SEE * * a2 « 2 -"- S77 28 - 950 1-00 25 ' 07 2 

*TnM 1408 p SEP A - 1 1?7 29 - 45 « 1-00 17-02 C 

5 ^ 1409 r » ser a 160 99.119 -1.717 26.797 l . PP 13.0? £ 

arftM 1410 Pf ser A 1 60 59-7B? -1.897 25.885 1.00 37.57 fi 

atom 1411 m val a 161 58,378 p.742 28.927 l.PP 21.01 M 

ATflM 1412 rt VAT, A 1 61 57.369 1 ,644 29.5P9 1 . P0 9.70 £ 

aTnM 1413 r vat, a 161 57,068 7-747 28.501 LOP 16.77 C 

10 »™m 1414 o vat, a 161 57.955 3.119 27.729 l.PP 16.33 Q 

atom 1415 ™ vat, a ifii 57.806 j> t 248 3P.862 l.PP I 7 . 94 £ 

at™ 1416 rr.1 WT. A lfil 57.873 1 .185 31.981 l.PP 16.16 £ 

it™ i4n c r.9 vat, a 161 7 , 997 30.750 l.PP 21,10 £ 

atom 1418 m met a 162 55.79* 3,117 28 . 11? 1.0P 22,46 H 

15 »t™ 1419 rA met a 162 55.296 LIB* 77.513 1.00 19. 2 3 £ 

vmm 1420 r met a ifi? S4.B80 5,312 28.397 1.00 25.19 £ 

»thm 1421 n mft a 162 53.76? 5 . 769 28.961 1.00 18.35 fi 

>thm 142? n> met a 162 53.979 3 . 796 26. ESQ l.QQ 1 5.55 £ 

ft TOM 1423 rn MKT A 162 54.013 2 , 630 25.919 l.PP 37.79 £ 

20 »™m 1424 sn met a 162 51.354, 3,100 21.235 l.PP 52.07 & 

atom 142S rv. met a 162 56.193 3.134 21.110 l.PP 36. 3P £ 

ATOM 1426 w PRO A 1 63 55.730 6 . 313 28.5 2 1 l.PP 18 . 13 H 

atom 1427 c * pro a 163 55.39P 7.177 29.337 l.PP 17 . 76 £ 

atom 1428 fi pro a 163 54.300 B.384 28.667 1.00 21.23 £ 

25 itom 1429 n pro a 163 54.208 8 , 448 77 . 433 1.00 15.20 fi 

ftTOM 1430 r« PRO A 163 56.727 B.196 29.423 1.00 11.43 £ 

»tom 1431 rr. pro a 163 57.352 7 874 78.031 1.00 13.99 £ 

ft TOM 1432 rn pro a l fi3 57.086 6 . 491 27.949 1.00 12 .2 4 £ 

ft-MM 1433 m thr a 164 53,478 2 , 060 79.478 1,00 13.95 M 

30 ftTOM 1434 rft TWR A 164 52.581 10.121 28.963 1.00 25.8 2 £ 

atom 1435 r. thr a 164 S3, 406 11,441 7P.781 1,99 19.67 £ 

ATOM 1436 o THR A 16 4 54.633 11.393 28.868 l.PP 1 3.97 2 

ftTOM 1437 r» thr a 164 SI , 373 IP. 393 79 . 9P3 l.PP 25.51 £ 

ATOM 1438 OKI THR A 164 5 0 -470 11.321 29.267 1.00 14.77 fi 

35 ftTOM 1439 r*i?9 THR A 1 6 4 51.818 10.886 31.798 1.PP 9.P6 £ 

ftTOM 1440 m asm a 165 sp.751 12,589 78.556 l.PP 14-9 5 H 

ftTOM 1441 rA ash a 165 S3 , 448 13.9P1 78 . 481 l.PP 7.83 £ 

ftTOM 1442 r ftSMA16S 54.167 14.064 29.824 l.PP 11. 21 £ 

ftTOM 1443 ft asm a 165 ,53.554 13. 92* MM94 l.PP 17,66 fi 

40 ftTOM 1444 a asm a 165 f>?,434 15.P61 78.116 l.PP 14,48 £ 

ftTOM 14 45 rr. asm a 165 ftl.497 14.941 77.767 1.P0 23.7P £ 

ftTOM 1446 fifil ASM A 1 6 5 51.939 14.800 26.129 l.PP 22,37 S 

ftTOM 1447 MT.? »SM A 1 65 50.173 14.925 27.539 1 .00 27.22 U 

atom 1448 v t.eu a 166 5ft , 4iB 14.49P 79 . 777 l.PP 8, 2 3 H 

45 atom 1449 c * t.eu a 166 56 . 187 14,6P4 30.994 l.PP 1 4.40 £ 
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ATOM 


1450 


£,. 


LEU A 166 


56,62? 


16.017 


31.129 


1.00 25.05 






ATCW 


1151 


o 


-LEV A 166 


56,624 


16.718 


39.125 


1.9P 25.9? 


0 




ATOM 


1152 


CB 


LEU A 166 


57.460 


13.71? 


30.870 


1.00 17.48 






ATOM 


1453 


_ CG 


LEU A 166 


57.423 


12.218 


39.652 


1.00 16.63 


c 


5 


ATPM 


1154 


CD1 LEO A 166 


SB. 837 


».63? 


31.999 


1.00 22.52 


c 




atck 


1455 


CD2 LEU * 166 


56.336 


11.53? 


31.511 


1.09 7.46 


c 




ATOM 


1156 


...H 


-IIR A 167 


57.146 


16.391 


32.390 


1.00 19.78 


N 




ATM 


1457 


CA 


TYR A 167 


57.678 


17.769 


32.511 


1.00 18.58 


c 




ATM 


115? 


-C... 


TYR A 167 


58,534 


17.763 


33.767 


1.00 15.53 


c 


10 


ATOM 


145? 


0 


TYR A 167 


58.171 


16.852 


34.575 


1.00 16.71 


0 




ATOM 


1469 


...CB 


TYR A 167 


56,50? 


18.778 


32.665 


1.00 18.33 


c 




ATOM . 


1461 


_E£L 


..HR A 167 


55.671 


18.561 


33.931 


1.00 14.23 


c 




ATOM 


1452 


CD1 TYR A 167 


54.M4. 


17,618 


33,977 


1.00 13.35 


c 




ATM 


14(5? 


-CP2 


TYR A 167 


55.984 


19.258 


35.106 


1.00 16,5? 


c 


15 


ATOM 


1464 


CEl TYR A 167 


53.88? 


17.446 


35.146 


1.00 21.17 


c 




ATOM 




CE2 TYR A 167 


55.302 


1?,PP4 


36.264 


1.00 6.26 


c 




ATOM 


1466 


.02 


TYR A 167 


54.228 


IB .203 


36.296 


1.00 23. 56 


c 




ATOM 


1467 


OH 


TYR A 167 


53.526 


18.078 


37.504 


1.00 22.81 


p 




ATOM 


146? 


N 


SLY A 168 


59.334 


18,797 


33.952 


1.00 16^9 


N 


20 


ATOM 


146? 


OA 


GLY A 168 


6P.1S8 


18.817 


35.152 


1.00 ib^i 


c 




ATOM 


1470 


c 


GLY A 16B 


. , 61,534 


19.42B 


34.880 


1.00 13.69 


c 




ATOM 


1471 


0 


GLY A 168 


61.746 


20.028 


33.B37 


1.00 16.52 


0 




ATOM 


1172 


. N 


PRO A 169 


62.473 


19.263 


35.817 


1.00 20 rT 73 


N 




ATOM 


1473 


CA 


PRO A 169 


63.801 


19.822 


35,656 


1.00 16.07 


c 


25 


ATOM 


1474 


.0 


PRO A 169 


64.430 


19.353 


34.387 


1.00 27. IB 


c 




ATOM 


1475 


_Q 


PRO A 16? 


64,305 


18.186 


33,981 


1.00 21.23 


0 




ATOM 


1476 


-CB 


PRO A 169 


64,595 


19.296 


36.805 


1.00 17.28 


c 




ATOM 


1477 


cp 


PRO A 169 


63.649 


18,919 


37.830 


1.00 19. 89 


c 




ATOM 


1470 


_£?._ 


PRO A 169 


62,263 


18.772 


37.J8? 


1.00 22.47 


c 


30 


ATOM 


1479 


H 


HIS A 170 


65,22? 


29.235 


33.829 


1.00 19.48 


N 




ATOM 


1490 


..CA 


HIS A 170 


65.95? 


19,877 


?2,63f> 


1.00 25.56 


c 




ATOM 


1181 


JC 


HIS A 170 


65,096 


19.707 


31.428 


1.00 29.15 


c 




ATOM 


1482 


_2_ 


HIS A 170 


65.553 


19.091 


3C.479 


1.00 29.71 


0 




ATOM 


1483 


CB 


HIS A 179 


66,783 


18.699 


32.815 


1.00 28.94 


c 


35 


ATOM 


1484 


.cg 


HIS A 170 


67,703 


18.671 


34.034 


1.00 33.88 






ATOM 


1485 


NDl HIS A 170 


6B.975 


19.203 


33.96? 


1.00 25.46 


N 




ATOM 


1486 


CD2 


HIS A 170 


67.518 


18.298 


35.326 


1.00 34.77 


c 




ATOM 


1487 


CEl HIS A 170 


69.531 


19.151 


35.166 


1.00 25.63 






ATOM 


1488 


NE2 


HIS A 170 


68.673 


18.6Q3 


36.008 


-1,90 3* ,72 


N 


40 


ATOM 


1489 


H 


ASP A 171 


63.BB1 


20.245 


31.440 


1.00 21.52 


N 




ATOM 


1490 


CA 


ASP A 171 


63.041 


20.267 


39.21? 


1.00 28.63 


C 




ATOM 


1491 


C 


ASP A 171 


63.630 


21.45? 


29.35? 


1.00 41.94 


C 




ATOM 


1492 


O 


ASP A 171 


64.534 


22.171 


2?. 835 


1.00 29.69 


0 




ATOM 


1493 


.» 


ASP A 171 


61,552 


20.558 


30.602 


1.00 26.40 


c 


45 


ATOM 


14?4 




ASP A 171 


69.552 


2P,P?7 


2?. 549 


1.00 22.32 


c 
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*THM 1495 QUI ASP A HI fin .ago 20.067 28.325 1.00 32.03 2 

atom 1496 on? asp a 171 59,127 19,719 29,916 1,00 42,13 £2 

atom 1497 m asw a 172 620il 21.712 29.1?7 1,90 H 

itom 1498 r-A asw a 172 fa.fiit 22,693 ;>7.308 i.po 35,95 £ 

5 »™" 1499 r ASW A 172 67.665 24.P56 77 . g74 1.00 33.71 £ 

ATOM 1500 O ASW A 172 61-586 24.102 27.104 1.00 32.69 2 

atom 1501 cb asw a 172 63,632 22.667 25.869 1.00 41.60 £ 

atom 1502 cs asw a 172 63.807 23.997 2?. 086 1.00 39.09 C 

"rnw 1503 QD1 asw a 172 6?. 973 24.347 24.259 1.00 83.94 fi 

10 *™m 1504 wn? asw a 172 64. 855 24.740 25.418 1.00 65.07 H 

atom 1505 w phe a 173 63.021 24.953 28,583 1.00 31.93 H 

ATOM 1506 CA PHB A 173 62.082 26.030 28.944 1.00 46.24 £ 

ATOM 1507 C PHE A 173 61 .989 27.260 28.045 1.00 69.01 £ 

atom 1508 o PHB A 173 62.278 28,395 28.465 1.00 58,79 Q 

15 atom 1509 CB PHE A 173 62.225 26.459 30.390 1.00 43.43 £ 

ATOM 1510 CG PHE A 173 61.867 25.399 31.356 1.00 34.19 £ 

atom 1511 cm phe a 173 62,810 24.488 31.751 1.00 24. 6P £ 

atom 1512 m2 phe a 173 60.621 25.354 31.925 1.00 24.94 £ 

atom 1513 cbi phe a 173 62.524 23.548 32.682 1.00 23.64 £ 

20 ATOM 1514 CE2 PHE A 173 60.305 24.366 32.804 1.00 31.32 £ 

ATOM 1515 CZ PHE A 173 61.263 23.457 33.192 1.00 24.30 £ 

atom 1516 w his a 174 61.510 27.036 26.931 1.00 68.16 H 

atom 1517 ta hts a 174 61.401 28.109 25.871 1.00 64,53 £ 

atom 1518 e his a 174 59.97? 29.221 25,400 1.00 71.58 £ 

25 ATOM 1519 O HTS A 174 59.309 27.186 25.249 1.00 73.20 fi 

atom 1520 cb his a 174 62.419 27,870 24.736 1.00 71.71 £ 

ATOM 1521 re HIS A 174 63.835 27.868 25.229 1.00 92.29 £ 

atom 1522 wm his a 174 64.921 27.539 24.440 1.00100.00 H 

ATOM 1523 rn2 HIS A 174 64.338 28.133 26.463 1.00100.00 £ 

30 ATOM 1524 C.TA HIS A 174 66.032 27.628 25.160 1.00100.00 £ 

ATOM 1525 WF.2 HIS A 174 65.705 27.981 26.393 1.00100.00 H 

ATOM 1526 W PRO A 1 75 59.469 29.461 25.262 1. 00 65.71 U 

atom 1527 rA pro a 175 58.109 29-658 24.770 1,00 55,72 £ 

ATOM 1528 C PRO A 175 58.233 29.297 23.267 1.00 75,83 £ 

35 ATOM 1529 O PRO A 175 57.224 29.226 22.554 1.00 69.59 £ 

ATOM 1530 CB PRO A 175 57.866 31.142 25.026 1.00 49.14 £ 

atom 1S31 rr. pro a 175 59.258 31.790 24.901 1,00 42.23 £ 

atom 1532 rn pro a 175 60.286 30.695 25.10? 1.00 49.59 £ 

atom 1S33 m ser a 176 59 , 480 28.954 22.879 1,00 85 . 09 H 

40 atom 1534 ta ser a 176 59.954 2P.474 21.548 1.00 81.18 £ 

atom 1535 r ser a 176 59,660 26.965 21.343 1.00 73. 9P £ 

atom 1536 n ser a 176 59.617 26.45P 2P.213 1.00 57.03 £ 

atom 1537 ™ ser a 176 61.49? 28.666 21.447 1.00 71.32 £ 

atom 1538 nn ser a 176 62.04B 29.349 22.578 1,00 51.93 2 

45 atom 1539 w asw a 177 59,5?p 26,276 22.480 1,00 66,23 H 
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BESS! 154P CA ASN A 177 59.274 74«47 22.619 1.00 56.41 £ 

ECO! 1541 C ASN A 177 57.810 24.497 22.353 1.00 60.91 £ 

aXSM 1542 P ASN A 177 56.914 25.215 22.811 i.OQ 55.58 Q 

AX2M 1543 CB ASN A 177 59.619 24.469 24.065 1.00 50.45 £ 

5 AIQM 1544 Cg ASN A 177 59.562 22.970 24.319 1.00 66.57 £ 

OEM 1545 PP1 ASN A 177 59.095 22.216 23.476 1.00100.00 £ 

AICM 1546 NP2 ASW A 177 60.099 22.546 25.464 1.00 35.61 ft 

AISM 1547 V SER A 12S 57.583 23.387 21.627 1.00 57.10 M 

AXCM 1548 CA SER A 178 56.234 22.853 21.279 1.00 50.50 £ 

10 AICM 154? C SER A 178 55.557 22.159 22.491 1.00 76 74 £ 

ABM 155g 0 SER A 178 54.575 21.400 22.304 1.00 99.63 Q 

AISM 1551 CB 5ER A 178 56.316 21.800 20. na 1.00 10. n £ 

AXCM 1552 Pg SER A 17ft 57.397 22.112 19.217 l.OO 71.69 £ 

Aiai 1553 N HIS A 179 56.134 22.284 23.694 1.00 37.39 M 

15 AISM 1554 CA. HIS A 179 55.569 21.587 24.855 1.00 30 r Q6 £ 

1555 C HIS A 179 54.961 22.616 25.767 1.00 21.93 £ 

1556 P HIS A 179 55.641 23.598 26.138 1.00 25.17 £ 

aiSM 1557 CB HIS A 179 56.634 20.683 25.575 1.00 36.20 £ 

1558 Cg HIS A 179 56.973 19.419 24.935 1.00 42.90 £ 

20 1559 HPl HIS A 179 56.973 19.3 35 23.457 1.00 49.52 M 

AlflM 15 SO CP2 HIS A 179 57.323 1B. 190 25.278 1.00 52.42 £ 

AICM 1561 CE1 HIS A 179 57.283 18.109 23.084 1.00 44.78 £ 

AlflM 1562 NS2 HIS A 179 57.500 17.393 24.168 1.00 50.49 H 

1563 N VAL A 18P 53.661 22.454 26.038 1.00 19.14 ft 

25 aiCH 1564 CA VAL A 18P 52.886 23.449 26.789 1.00 29.03 £ 

SKM 1565 C VAL A 18P 53.373 23.890 28.142 1.00 31.29 £ 

AICM 1566 P VAIi A 180 53.348 25.075 28.447 1.00 19.55 £ 

aiCM 1567 CB VAL A 188 51.403 23.115 26.914 1.00 35.47 £ 

KESM 1568 Cgl VAI- A 1PP 50.630 24. 399 27.217 1.00 35.84 £ 

30 1569 Cg2 VAL A 18P 50.923 22. 550 25.663 1.00 36.11 £ 

aiCM 1570 N ILE A 1P1 53.684 22.935 29.005 1.00 26.57 

1571 CA ILE A 181 54.138 23.285 30.360 1.00 74.49 £ 

&X£ai 1572 C ILE A 181 55.371 24.213 30.361 1.00 16.51 £ 

1573 P ILE A 181 55.326 25.315 30.909 1 .00 74.47 £ 

35 &I£M 1574 CB ILE A 381 54.285 22. 018 31.264 1.00 20.20 £ 

1575 Cgl ILE A 181 52.878 21.428 31 .578 1.00 18.22 £ 

1576 Cg2 ILE ft 1P1 55.014 22.315 37.5R1 i nn n. xi £ 

AlSai 1577 CD1 ILE A 181 52.867 20.086 32.786 1.00 B.03 £ 

&XSM 157? N PPP A 182 56.452 23.779 29.718 1 .00 77.71 fl 

40 MSM 1579 CA PRP A 18?, S7.664 24.605 29.640 1.00 22.07 £ 

MflM 158P C PRO A 182 57.379 25.852 28.828 1.00 24.18 £ 

ai21 1581 P PRP A 182 57.811 26.949 29.210 1.00 18.35 £ 

aXSM 1582 CB PRP A 18? 58.682 23.72S 28.890 1.00 24.97 £ 

&XS1 1583 CG PRO A 182 57.925 22.473 28.471 1.00 75.77 £ 

45 &XPH 1584 CP PRP A 1P2 56.727 22.359 29.401 1.00 18.23 £ 
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ATOM 1585 W ATA A 1B3 Sfi.628 25.707 91.199 1.00 21.45 ft 

ATOM 1586 CA ATA A 1B3 56.261 26.896 76943 1.00 21.66 £ 

AT£M 1587 C MA A 183 55.464 27.900 27.811 1.00 26.10 £ 

ATOM 1588 O ATA A 183 55.773 29.091 27.856 1.00 19.50 Q 

5 ATOM 1589 CB AIA A 183 55.473 26.513 25.703 1.00 13.26 £ 

ATOM 1590 W LEV A 184 54.472 27.389 28.543 1.00 23.34 g 

ATOM 1591 CA LEO A 184 53.642 28.215 29.401 1.00 19.05 £ 

ATOM 1592 C LEV A 18J 54.312 28.693 30.655 1.00 21.91 £ 

AXflM 1?93 P LEV A 184 29.771 31.158 1.00 19.71 Q 

10 ATOM 1594 CB LEU A 184 52.309 27.553 29.715 1.00 14.41 £ 

ATOM 1595 CG LEU A 184 51.342 27.S95 28.525 1.00 23.42 £ 

AXfitt 1596 CP1 LEO A 184 49.918 27.244 28.928 1.00 31.06 £ 

him 1597 CP2 LEV A 184 51.380 28.896 27.690 1.00 21.73 £ 

AX0U 1598 N LEV A 185 55.178 27.879 31.213 1.00 18.39 H 

15 KIM 1599 CA LEV A 185 55.833 28.332 32.417 1.00 16.39 £ 

AlfiM lfSPQ C LEV A 185 56.669 29.528 31.985 1.00 23.67 £ 

AXQH 1601 P LEV A 185 56.681 30.590 32.644 1.00 29.38 £ 

ATOM ISM CB LEV A 185 56.723 27.233 33.015 1.00 15.05 £ 

AXflM lifll £g LEV A 185 56.021 26.348 34.041 1.00 15.56 £ 

20 him ISM CP1 LEV A 185 se.ai9 25.022 34.301 i.oo 21.06 £ 

AI3f 1605 CP2 LEV A 185 55.722 27.113 35.321 1.00 11.02 £ 

Aiai Ififlfi » ARC a 186 57.337 29.397 30.852 1,00 17.09 II 

AICM 1607 CA ARg A 186 58.137 30.523 30.429 1.00 18.82 £ 

ATOM 1608 C AR.g A, 186 57.308 31.752 30.069 1.00 29.00 £ 

25 him 1609 O ARG A 186 57.629 32.880 30.476 1.00 23.91 £ 

ATOM 1610 CB ARG A 186 59.026 30.146 29.281 1.00 22.06 £ 

ATOM 1611 CG AUG A 186 59.653 31.365 28.652 1.00 38.46 £ 

AICM 1612 cd arg a 186 60,825 31.804 29.462 1.00 83.66 £ 

ATOM 1613 HE ARG A 186 62.012 31.861 28.631 1.00 70.77 H 

30 ATOM 1614 CZ ARG A 186 63.058 32.622 28.904 1.00 91.68 £ 

AlStt 1615 mi ARg A 186 63.053 33.386 29.995 1.00 56.56 H 

AXQH 1616 NH2 ARG A 186 64.098 32.639 28.082 1.00100.00 N 

AICM 1617 W ARg ft 187 S6.234 31.544 29.310 1.00 20.96 H 

AICM 1618 CA ARG A 187 55.361 32.662 28.941 1.00 19.32 £ 

35 ATOM 1619 C ARg A 18? 54.765 33.453 30.142 1.00 28.41 £ 

him 1620 P ARg A 187 54.823 34.700 30.193 1.00 17.23 2 

him 1621 CP ARg A 187 54.270 32.223 27.957 1.00 17.05 £ 

ATOM 1622 CG ARG A 187 54.813 31.546 26.720 1.00 61.42 £ 

hlSM 16?3 CP ARg A 187 53.696 31.244 25.757 1.00 44.57 £ 

40 ATOM 1624 WE ARG A 187 53.033 32.472 25.354 1.00 29.47 H 

hlSU 1625 Ct ARg A 187 51.831 32.534 24.790 1.00 17.82 £ 

him 1626 am ARg a m 5i.i3P 31.427 24.544 1.00 24.95 u 

ATOM 1627 NH2 ARG A 187 51.341 33.716 24.447 1.00 37.77 K 

ATOM 1628 M PHE A 18B 54.192 32.734 31.101 1.00 23.48 H 

45 him 1629 CA PHE A 188 53.604 33.399 32.259 1.00 21.24 £ 
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ATOM 


1630 


PHE A Iflfl 


54.631 


34.080 


33.095 


_J.OO 21.39 






ATCW 


mi 


^0 PHE A 1RR 


54.394 


.. 35,126 


33.626 


_JL.OO 23.90 


c 




ATOM 


1632. 


__CB PHE A Iflfl 


52.723 


32.466 


33.077 


1.00 19.95 


o 




ATOM 


1633 


CG PHE A m 


51,39? 


32.215 


32.435 


1.00 ??.?B 


c 


5 


ATOM 


1634 


_CB1 PHE A 1 BR 


SQ, HQ 


33.22? 


32.375 


_1.00 19.4? 


C 




ATOM 


1635 


CD2 PHE A 1 RR 


51.141. 


31.039 


31.731 


1.00 23.82 


c 




ATOM 


iwe 


— CE1 — PHE A IBB 


19.191 


33.026 


31.712 


1.00 24.77 


C 




ATOM 


1637 


^CE2 PHE A 1RR 


49.936 


30.826 


31.057 


1.00 20.17 


c 




ATOM 


1638 


gs PHE a iee 


48.945 


31,615 


31.068 


1.00 23.14 


c 


1 A 
10 


Aim 


163? 


_M His A 18? 


.. 55.831 


33.513 


33.118 


1.00 24.15 


c 




ATOM 


1640 


_CA HIS A 1R0 


56.933 


34.122 


33.837 


1.00 28.79 


N 




ATOM 


1641 


__C HIS A 189 


- 57.303 


35,506 


33.315 


1.00 2R.5R 


c 




ATOM 


1642. 


_Q HIS A 189 


57.480 


36.463 


34.083 


_L00 20.07 


c 




ATOM 


1643 


_CB^«IS A 189 


58,148. 


. 33,268 


33.641 


1.00 31.38 


o 


15 


ATOM 


1644 


_£G HIS A 189 


59.364 


33,841 


34.290 


1.00 29.98 


c 




ATOM 


1645 


ND1 HIS A 180 


59,548. 


.33.933 


35.659 


1.00 31,Q0 


c 




ATOM 


1646 


CD2 HIS A 189 


. 60,44? 


.34.161 


33.766 


1.00 ?1.79 


N 




ATOM 


1647 


_CE1 HIS A 1R9 


-. 60.722 


34.371 


35.945 


1.00 ?4.04 


c 




ATOM 


1649 


_ NE2 HIS A 1R9 


61,257 


31.P15 


34r821 


1.00 19.53 


c 


20 


ATOM 


164? 


_M_. GLU A 190 


.. 57.539 


35.563 


...32,006 


1.00 2R.43 


ft 




ATOM 


1650 


CA GLU A 190 


57.876 


36.816 


31.324 


1.00 27.7? 


If 




ATOM .. 


1651 


_C GLU A 190 


56.725 


37,929 


31.437 


1.00 32,56 


c 




ATOM 


1652 


0 GLU A 190 


56.949 


38.995 


31.717 


1.00 ?7.06 


c 




ATOM 


1653 


CB GLU A 190 


- 58.122 


36.529 


29,819 


_L00 28.55 


o 


25 


ATOM ,._ 


1654 


CG GLU A 190 


, 59.150 


35,461 


29.614 


_1.00 35.29 


c 




ATOM 


1655 


CD GLU A 190 


60.553 


35,911 


29.892 . 


1.00 99.81 


c 




ATOM 


1656 


_QE1 GLU A 190 


60.913 


36.037 


31.085 


1.00 86.56 


c 




ATOM 


1657 


OE2 GT.P A 1QQ 


61.293 


36,167 


28,910 


1.00100.00 


o 




ATOM 


165? 


N ALA A 191 


55.493 


37.391 


.31,196 


Jl.QO 32.67 


p 


jU 


ATOM _ 


1659 


CA ALA A 191 


54.349 


38,286 


31.311 


J. 00 25.30 


N 




ATOM 


1660 


C ALA A 19J 


54.?R7 


38.795 


32,742 


1.00 36. ?0 


0 




ATOM 


1661 


-2 ALA A 191 


53.92P 


39,924 


33,014 . 


1.00 27.5? 






MOM 


1662 


CB ATA A lg] 


53.05? 


37,56? 


31.000 


1.00 16. 4R 


Q 




ATOM 


166? 


N THR A 19? 


54.549 


37,?27 


33,693 . 


1.00 29.39 


0 


35 


ATOM 


1664 


CA THR A 19? 


54.395 


38.396 


35.041 


1.00 19. OR 


N 




ATOM 


1665 


C THR A 19? 


55.420 


39.494 


35.29? 


1.00 44. 7R 


c 




ATOM 


1666 


0 THR A 19? 


55.094 


40.550 


35.839 


1.00 40. 5R 


c 




ATOM 


1667 


CB THR A 19? 


54.515 


37.235 


35.983 


1.00 18.99 


o 




ATOM 


1668 


fVZI TWP & 1 QO 


53. 410 


36.348 


35.755 


1.00 34.36 


0 


40 


ATOM 


1669 


CG2 THR A 19? 


54.461 


37.738 


37,425 


1.00 21.15 


0 




ATOM 


1670 


K ALA A 191 


56.617 


39.312 


34,757 


1.00 48.58 


0 




ATOM 


1671 


CA ALA A 191 


57.705 


40.286 


34.905 


1.00 50.59 


K 




AT«* 


1672 


C ALA A 193 


57.496 


41,613 


34.145 


1.00 54.42 


0 




ATOM 


1673 


O ALA A 193 


. 57.952 


42.698 


34.553 


1.00 48.28 


0 


45 


ATCW 


1674 


CB ALA A 193 


—59,047 


39.640 


34.496 


L.00 51.78 


Q 

0 
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ATOM 1675 W GLM A 194 SSR10 41.530 33 n?9 1.00 41.16 N. 

ATOM 1676 CA GLN A 194 56.586 42.722 32.24 2 1.00 3B.03 £ 

ATOM 1677 C GLM A 194 55.264 43.389 37.576 1.00 40.85 £ 

AX0B 1679 O glN ft 194 54.830 44.284 31.845 1 . 00 si .90 Q 

5 AXPtf 1679 CB gLW ft 194 S6.599 42.358 30.750 1.00 35.96 £ 

ATOM 1680 CC GLW A 194 57.910 41.692 30.290 1.00100.00 £ 

ATOM 1681 CP gLH A 194 57.715 40.661 29.158 1.00100.00 £ 

ATOM 1682 OE1 GLM A 194 56.619 40.546 28.579 1.00100.00 Q 

ATOM 1683 HE2 GLM A 194 58.782 39.904 28.848 1.00100.00 H 

10 ATOM 1684 M GLY A 195 54.583 42.949 33.630 1.00 32.29 M. 

ATOM 1685 CA GLY A 195 53.236 43.464 33.864 1.00 36.26 £ 

ATOM 1686 C GLY ft 195 52.299 43.332 32.593 1.00 45.33 £ 

axai 1687 p g&Y ft 195 51.515 44.242 32.346 i.oo 45.ie s 

ATOM 1688 N GLY A 196 52.405 42.245 31.788 1.00 36.33 H 

15 ATOM 1689 CA GLY A 196 51.515 41.965 30.608 1.00 19.06 £ 

ATOM 1690 C GLY A 196 50.037 41.958 31.117 1.00 22.49 C 

ATPM 16?1 p gLY ft 196 49.724 41.479 32.223 1.00 33.09 Q 

ftXSM 1692 W .PRO ft 197 49.144 42.657 30,4,31 l.PP 29.22 H 

&ZSM 169? Cft PRQ A 197 47.790 42.732 30.953 1.00 25.29 £ 

20 AXOl 16?4 C PRO A 197 47.091 41.413 30.674 1.00 24.64 £ 

ATOM 1695 O PRO A 197 46.192 40.991 31.411 1.00 24.75 £ 

ftXSH 1696 CB PR? A 197 47.162 43.911 30.176 1.00 26.31 £ 

AlfiU 1697 CP PRO ft 197 48.188 44.407 29.252 1.00 26.56 £ 

AX£M 1698 gn PRO A 197 49.307 43.454 29.203 1.00 30.25 £ 

25 ftiai 1699 W tS£ ft 199 47.572 40.723 29.658 1.00 16.88 H 

ATOM 1700 CA ASP A 198 47.067 39.418 29.405 1.00 21.65 £ 

him 1701 c ftsg ft 198 4B f p46 38,522 28.677 l.PP 31,28 £ 

ATOM 1702 O ASP A 198 49.062 38.978 28.172 1.00 34.57 £ 

ATOM 1703 CB ASP A 198 45.739 39.507 28.669 1.00 32.80 £ 

30 ftlfiU 1704 CG ASP A 198 45.B6B 40.055 27.256 1.00 46.13 £ 

ATOM 1705 OD1 ASP A 198 46.982 40.230 26.725 1.00 57.45 £ 

AISH 1706 OD2 ASP A 19B 44.817 40.271 26.640 1.00 67.61 £ 

ATOM 1707 M VAL A 199 47.713 37.234 2B.614 1.00 38.67 ft 

ATOM 170B CA VAL A 199 4B.499 36.226 27.901 1.00 27.79 £ 

35 atom 1709 c val a 199 47.462 35.469 27.P65 1,PP 25. PP £ 

him 1710 O VAL A 199 46.460 35.023 27.598 1.00 24.22 £ 

him 1711 cb val a 199 49.163 35.229 28.905 LuBfl 24, 37 £ 

ATOM 1712 CGI VAL A 199 49.B74 34.047 28.160 1.00 20.28 £ 

atom 1713 cg2 val a 199 5P.121 35.942 29.935 1,00 22.25 £ 

40 ATOM 1714 N VAL A 200 47.661 35.3B6 25.757 1.00 23.72 

atom nis ca val a 200 46.7P1 34.694 24.9P3 l.PP 23.99 £ 

him 1716 c Vftl ft 2PP 47.167 33.286 24.499 l.PP 22.65 £ 

ATOM 1717 O VAL A 200 48.321 33.10B 24.188 1.00 29.77 £ 

ATOM 1718 CR VAL A 200 46.358 35.548 23.680 1.00 23.11 £ 

45 ATOM 1719 CGI VAL A 200 45.561 34.737 22.598 1.00 16.25 £ 
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ATOM 


1720 


CG2 VAL A 200 


15,652. 


36.823 


21.130. 


1.00 27.86 






ATOM 


1721 


_N 


VAL A 201 


46.296. 


32.279 


21.632 


1.00 27.39 


N 




ATCM 


1722 


OA 


VAL A 201 


46.588 


30.893 


24.265 


1.00 9.63 


C 




ATOM 


1729 


-C 


VAL A 201 


45.653, 


30.529 


23.165 


1.00 19.63 


g 


5 


ATCM 


1721 


o 


VAL A 201 


44.452 


30.755 


23.312 


1.00 17.61 






ATOM 


1725 


_CB 


VAL A 201 


46.306 


29.952 


25.126 


1.00 19.95 


c 




ATOM 


1725 


CGI VAL A 201 


46,703 


28.51? 


25.054 


1.00 20.85 






ATOM 


1727 


CG2 VAL A 201 


17.086 


30.439 


26.661 


1.00 16.73 


c 




ATOM 


1728 


. H 


TRP A 202 


46.210 


30.080 


22.030 


1.00 14.36 


N 


10 


ATOM 


172? 


_CA 


TRP A 202 


15.422 


29.693 


20.865 


1.00 18.97 


c 




ATOM 


173? 


..c 


TRP A 202 


44.195 


28.572 


21.313 


1.00 36,22 


c 




ATOM 


1731 


.J?. 


TRP A 202 


44.934 


27.694 


22.057 


1.00 31,46 


o 




ATOM 


1732 


... OB 


TRP A 202 


46.292 


29.05S 


19.823 


1.00 19.14 


c 




ATOM 


17?? 


_CS 


TRP A 202 


47.243 


29.894 


19.066 


1.00 33.65 


c 


15 


ATOM 


1734 


...CD1 


TRP A 20? 


48.391 


29.463 


18.429 


1.00 35.28 


c 




ATOM 


1735 


CD2 TRP A 20? 


47.126 


31.282 


18.772 


1.00 39.90 


n 




ATOM 


1736 


_HE1 


TRP A 202 


48.941 


30.481 


17.693 


1.00 37.86 


K 




ATOM 


1737 


CE2 TRP A 202 


48.228 


31.624 


17.922 


1.00 38.35 


c 




ATOM 


1738 


_CE3 


TRP A 202 


46.206 


32.281 


19.138 


1-00 39.39 


c 


20 


ATOM 


17?? 


CZ2 TRP A 202 


48.380 


32.884 


17.367 


1.00 36.15 


n 




ATCM 


174Q 


C23 TRP A 20? 


46.356 


33.542 


18.578 


1.00 39.60 






ATCW 


1741 


CH2 


TRP A 20? 


47.428 


33.828 


17.684 


1.00 40.99 


c 




ATOM „, 


1742 


_H _ 


GLY A ?03 


43.245 


28.564 


20.842 


1.00 25.59 


If 




ATOM 


1713 


_CA 


GLY A 203 


42.332 


27.483 


21.169 


1.00 13.09 


c 


25 


ATCM 


1744 


_£ 


. SLY A 203 


41.260 


27.813 


22.193 


1.00 21.12 


c 




ATOM 


1745 


_0 . 


GLY A 203 


41.340 


28.815 


22.886 


1.00 22.86 


o 




ATOM 


1746 


N 


SER A 204 


40.270 


26.919 


22.262 


1.00 16.88 


N 




ATOM 


1747 


-OA 


SER A 204 




26.979 


23.192 


1.00 18.36 


c 




ATOM 


1748 


...c 


SER A 204 


39.56J 


26,664 


24.65? 


1.00 22.07 


c 


30 


ATCM 


174? 


0 


SER A 204 


38,888 


27.096 


25.604 


1.00 34.39 


o 




ATOM 


1750 


CB 


SER A 204 


38.053 


25.998 


22.740 


1.00 9.99 


c 




ATOM 


1751 


OG 


SER A 204 


38,237 


24,695 


23.291 


1.00 16.37 


o 




ATOM 


17?2 


Jf 


GLY A 205 


40,562 


25,813 


24.854 


1.00 12.42 


N 




ATOM 


1753 


-£A 


GLY A 205 


40.963 


25.411 


26.208 


1.00 11.64 


c 


35 


ATOM 


1754 


C 


GLY A 205 


40,208 


24.178 


26.711 


1.00 19.49 


c 




ATOM 


1755 


0 


GLY A 205 


40.422 


23.723 


27.838 


1.00 13.59 






ATOM 


1756 


N 


THR A 206 


39.292 


23.683 


25.881 


1.00 15.38 


Q 
N 




ATOM 


1757 


. OA.. 


THR A 206 


38.132 


.22.591 


26.281 


1.00 10.80 


c 




ATOM 


1758 


.0 


THR A 206 


39,056 


21.221 


26,154 


1.00 26.39 


c 


40 


ATOM 


175? 


0 


THR A 206 


38.564 


20.267 


26,737 


1.00 23.28 






ATOM 


1759 


.CB 


THR A 206 


37,124 


22.562 


25.460 


1.00 12.86 


c 




ATOM 


, 1761 


OGl THR A 206 


37.438 


22.395 


24.082 


1.00 13.12 






ATOM 


1762 


CG2 THR A 206 


36.348 


23.840 


25.620 


1.00 10.62 


p 




ATOM 


176? 


H 


PRO A 207 


40.101 


21.083 


25.354 


1.00 21.10 


0 


45 


ATOM 


1764 


OA , 


PRO A 207 


40.658 


19.743 


25.175 


1.00 18.15 


N 
0 
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fl T n*t nes r pro a ?m jJLJlfi ™ 1.00 21.75 £ 

a T ™ nee o pro a 207 41 JS3 19,925 77.215 1.00 20.65 2 

KBM 1767 r * ppq a 207 19. SO? 74 . 013 1.00 17. S3 £ 

A TP* 1768 m ppn a 207 41.146 21.213 23.307 1,00 21.45 C 

5 athm 1769 rn pro a 207 40 . 698 22.067 74 . 431 1.00 23.41 £ 

btom 1770 w met a 2 flft 41.11? 17.676 76.674 1.00 15.60 H 

1771 CA MET A 208 17.167 77 . 77* 1.00 2 2 . 94 £ 

*™* 1772 r . mp.t a 208 U^fifl 16.4 2 7 77 . 579 1.00 21,90 £ 

ATOM 1773 O MET A 909 43. 94« 15.677 26.633 1.00 23.16 2 

10 ft TOM 1774 fi» MET A 908 4 11 .645 16.273 28.386 1.00 32.86 £ 

ATOM 1775 rr, met A 9QB 39.630 17 057 29.223 1.00 46.17 £ 

atom 1776 s n met a 208 3P 3JU IS. 990 ?? . P26 1.00 57.85 & 

itom 1777 c e met a 208 17 . ?^ 15.028 28.343 1.00 59,23 £ 

atom 1778 w arg a 209 14 , 0?? 16.681 78 . 456 1.00 17,7? H 

15 atom 1779 r.A arg a 209 45 . 318 16.042 28.324 1.00 19.88 £ 

vroM 1780 fi arc a 2 0Q 45.871 15.534 29.639 1.00 16.92 £ 

ATOM 1781 O ARG A 909 4 5 .41* 15.946 30.697 1.00 16,58 2 

ATOM 1782 r.R ARG A 909 46.140 16.963 27.658 1.00 21.07 £ 

ATOM 1783 rr, ARG A 2 09 45.9R0 17.478 26.275 1.00 22.57 £ 

20 atom 1784 cti arg A 2 Q9 45.833 16.357 25.282 1.00 28.26 £ 

atom 1785 he arg a 209 45 . 5P6 16.619 73.906 1.00 23.15 H 

atom 1786 r* arg a 209 44 , 4?" 16.742 23.267 1.00 34.52 £ 

atom 1787 khi arg a 209 41 . 336 16 .2 67 73.890 1.00 18.03 H 

atom 1788 arg a 209 44 , ^9 17.175 2 2 .012 1 . 00 29.78 H 

25 atom 1789 w GT.n a 210 46.878 14.675 29.547 1.00 20.87 H 

atom 1790 <-a GT.p a 210 47.530 14 . P79 30.720 1.00 17.37 £ 

atom 1791 fi ci-n a 210 19,031 14.490 30.851 1,00 20.96 £ 

atom 1792 o GT.o a 210 49,748 14.622 29.841 1.00 22,44 £ 

ATOM 1793 CP. CIO A 910 47 400 12.562 30.571 1.00 16.26 £ 

30 atom 1794 rr, glp a 210 47 . 807 U . 785 31.809 1.00 19.9 1 £ 

atom 1795 r n gt.p a 210 1U5J 10.304 31.531 1.00 27.81 £ 

atom 1796 QEi glu a 210 48.111 9,*19 30,34? 1-PP 17.29 2 

atom 1797 oe2 GiiU a 91Q 48.26B 9.54P 32.494 l.PP 21.63 2 

ATOM 1798 W PHE A 211 49 ,504 14.712 32.084 1.00 14.02 U 

35 atom 1799 r.A phe a 211 50 , PB7 15.159 32.353 1.00 17.48 £ 

atom 1800 c phe a 211 51 45P 14.414 33,531 1.00 33.62 £ 

atom 1801 n phe a 21 1 so.716 14.031 34.443 1 .00 27.96 2 

atom 1802 r» phe a 211 50 , 933 16,677 32.644 l.pp 17,78 £ 

atom 1803 rr. phe a 2 l i 50.303 17.490 31 .541 1.00 21.49 £ 

40 atom 1804 rm phe a 211 51 009 17.676 3SL22& 1,00 17.36 £ 

atom i»05 rn 2 phe a 211 ifl 233 17,844 31.618 1,00 15,09 £ 

atom 1806 e g phe a 211 50 333 18.334 ?9.237 1.00 1 6.37 £ 

atom 1807 r B ? phe a 211 4P . 7PP 18.491 30 . 533 1.00 9.63 £ 

atom 1808 rs phe a 211 49 . Q53 18.756 ?«> , 344 1,00 12.7? £ 

45 atom 1809 n t.eu a 212 57 761 14 . 161 13 . 495 1 . 00 2 3.76 H 
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ATOM 1810 CA LEO A 212 53.405 13.448 34.603 1 .00 21.24 £ 

ATOM 1811 C LEO A 21? 54.772 14.053 34.898 1.00 14.00 £ 

ATOM 1912 O LEO A 212 55,519 33,985 1,00 13,9? fl 

ATOM 1813 CB LEO A 212 53,548 U.S51 34,294 1.0Q 21,52 £ 

5 ATOM 1814 CG LEO A 212 54.033 11.039 35,406 l.PQ 21.0? £ 

ATOM 1815 CP1LEOA 212 52.866 10.634 36.280 1.00 20.84 £ 

ATOM 1816 CD2 LEO A 212 54.768 9.829 34.832 1.00 13.18 £ 

atom 1817 n his a 213 55.023 11.302 36.175 It 00 9 .60 H 

ATOM 1818 CA HIS A 213 56.290 14.864 36.555 1.00 13.66 £ 

10 ATOM 1819 C HIS A 213 57.380 13.828 36.293 1.00 20.37 £ 

ATOM 1820 O HIS A 213 57.238 12.614 36.542 1.00 16.08 £ 

ATOM 1821 CB HIS A 213 56.280 15.250 38.002 1.00 18.72 £ 

aXOS 1822 Cg HIS A 213 57.491 16.017 38,108 i,QQ 21.22 £ 

aiSti 1823 NP1 BIS A 21? 58.703 15.4Q6 38.656 i.po 21,29 H 

15 ATOM 1824 CD2 HIS A 213 57.716 17.353 38.499 1.00 23.67 £ 

ATOM 1825 CE1 HIS A 213 59.615 16.331 38.917 1.00 19.13 £ 

ATOM 1826 NE2 HTS A 213 59.041 17.523 38.847 1.00 21.99 H 

AXSM 1827 N VMr A 211 58.15? 14.295 35.698 1.00 21.07 H 

ATOM 1828 CA VAL A 214 59.532 13.383 35.361 1.00 19.23 £ 

20 ATOM 1829 C VAL A 214 60.067 12.523 36.551 1.00 27.20 £ 

ATOM 1830 O VAL A 214 60.604 11.444 36.359 1.00 22.23 Q 

ATOM 1831 CB VAL A 214 60.625 14.125 34.566 1.00 11.84 £ 

ATOM 1832 CS1 VAL A 214 61.390 15.199 35.485 1.00 8.52 £ 

ATOM 1833 CG2 VAL A 214 61.560 13.097 33.902 1.00 12.39 £ 

25 atom 1834 k asp a 215 59.893 12.994 37.790 1.00 25.29 H 

atom 1835 ca asp a 215 60.406 12.228 38.936 1.0P 18.19 £ 

him 1836 C ASP A 215 59.53P 11.P23 39.230 1.00 13.85 £ 

AXSM 1837 P ASP A 215 59.988 9,981 3?. 666 1.00 17.44 £ 

ATOM 1838 CB ASP A 215 60.575 13.129 40.155 1.00 16.27 £ 

30 ATOM 1839 CG ASP A 215 61.859 13.979 40.068 1.00 30.73 £ 

ATOM 1840 QUI ASP A 215 62.782 13.614 39.308 1.00 23.02 Q 

ATOM 1841 OP2 ASP A 215 61.957 15.029 40.730 1.00 26.00 Q 

ATOM 1842 H ASP A 216 58.276 11.136 38.863 1.00 20.08 K 

ATOM 1843 CA ASP A 216 57.378 10.017 39.016 1.00 18.78 £ 

35 atom 1844 c ASP A 216 57,761 9.083 37,894 l.PP 33.56 £ 

atom 1845 o ASP A 216 57.715 7,880 38.026 1.00 20.79 2 

atom 1846 cb asp a 216 55.912 10,457 36.821 l.PP 17,18 £ 

ATOM 1847 CG ASP A 216 55.193 10.757 40.162 1.00 38.03 £ 

ATOM 1848 OD1 ASP A 216 55.503 10.119 41.223 1.00 26.02 £ 

40 um 1849 OD2 asp a 216 54,249 11.587 40,124 l.PP 25,41 2 

atom 1850 w MET A 217 58.P92 9.653 36,755 l.PP 18.11 H 

atom 1851 ca met a 217 58.394 8.785 35.636 1.00 22.41 £ 

ATOM 1852 C MET A 217 59.572 7.942 35.992 1.00 27.54 £ 

ATOM 1853 O MET A 217 59 .579 6.752 35.710 1.00 20.86 £ 

45 ATOM 1854 CB MET A 217 58.637 9.592 34.345 1.00 21.24 £ 
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ATOM 1855 Cfi MF.T A 217 59.478 8.91B 33.287 1.00 16.37 £ 

ATOM 1B56 sn MF.T A 217 58.962 7.412 32.473 1.00 30.51 £ 

ATOM 1857 CE MET A 217 57.465 7.608 32.391 1.00 10.57 £ 

ATOM 1858 W AT.A A 218 60-561 8.562 36.623 1.00 19.09 ft 

5 ATOM 1BS9 CA ATA A 218 §1,771 7-841 37.002 1.00 13.65 £ 

ATOM I860 C ATA A 218 61.436 6.778 38.028 1.00 22.61 £ 

ATOM 1861 O ATA A 218 61,934, 5.670 37.967 1.00 19.36 Q 

ATOM 1862 CB AIA A 218 62.899 8.780 37.579 1.00 12.23 £ 

ATOM 1863 W AjA, ft 219 69.69? 7.109 39.000 l.OO 19.34 H 

10 ATOM 1864 CA AIA A 219 6P.319 6.105 40.023 1.00 Ifl.Ol £ 

ATOM 18 65 C AjA ft 219 59.639 4.901 39.413 1.00 23.57 £ 

ATOM 1866 O ftLA ft 219 59.781 3.777 39.B98 1.00 22.71 £ 

ATOM 1867 CB AIA A 219 59.387 6.678 41.083 1.00 10.11 £ 

ATOM 1868 W &LA, ft ZZS 58.753 5.174 38.454 1.00 18.99 H 

15 ftiai 1B69 CA AIA A 220 57.995 4.158 37.855 1.00 14.12 £ 

ATOM 1870 C ftLA, ft 229 59.753 3.213 37.034 1.00 25.33 £ 

ATPM 1871 0 flJUft A 220 58.581 2.006 37.114 l.oo 20.63 Q 

ATOM 1872 CB AIA A 220 56.796 4.798 37.023 1.00 8.53 £ 

ATOM 1873 W SER ft 221 59.779 3.772 36.379 1.00 23.92 K 

20 ATOM 1874 CA SER A 221 69.792 3.011 35.556 1.00 18.38 £ 

ATOM 1875 C SER A 221 61.537 1.989 36.353 1.00 20.90 £ 

AICM 1876 P SE&JL22A 61.683 P. 799 35.983 1,90 19.81 Q 

&XQH 1877 CB SER A 221 61.691 3.985 31.891 1.99 19.67 £ 

atom 1878 os SER a 221 69.817 1.711 33.867 1,98 15.61 £ 

25 ATOM 1879 W ILE A 222 62.083 2.476 37.463 1.00 18.12 H 

ATOM 1880 CA TLB A 222 62.866 1,611 38.381 1,99 21,56 £ 

ATOM 1881 C ILE A 222 62.020 P. 551 39.968 l.PP 29.19 £ 

ATOM 1882 O ILE A 222 62.504 -0.566 39.307 1.00 19.03 £ 

ATOM 1883 CB ILE A 222 63 . 467 2,516 39.132 1.9P 21,56 £ 

30 Aieu 1881 CS1 ILE A. 222 61.165 3.173 38.765 l.PP 32.13 £ 

Kim 1885 CC2 ile a 222 61,129 1.671 1P.5PP l.PP 28.26 £ 

A12M 1886 CP1 ILE ft 212 61.973 1.585 39.619 1,PP 15.61 £ 

ATOM 1887 M HIS A 223 6P.772 P.9P7 39.381 l.PP 19.31 H 

ATOM 1888 CA HIS A 223 59.829 -0.031 39.996 1.0P 20.16 £ 

35 ATOM 1889 C HIS A 223 59.599 -1.097 38.964 1.00 24.82 £ 

AiaM 1890 O His ft 22? 59,723 -2.283 39.27P 1,9P 21,66 £ 

atom 1891 cb his a 223 58.465 P. 637 1P.359 l.PP 19.53 £ 

ATOM 1892 CS HTS A 223 57.373 -0.333 40.759 1.00 28.64 £ 

1B93 wni hts a 223 57,921 -P. 561 12.P82 l.PP 21.16 H 

40 ATOM 1B94 CD 2 HTS A 223 S6.497 -1.062 40.004 1.00 30.39 £ 

&X2M 1995 CE1 HIS ft 223 55.983 -1.399 12.11? l.PP 3P.39 £ 

AIQM 1896 ME2 HIS A 22? 55.652 -1,727 IP. 869 l.PP 28.13 U 

ATOM 1897 N VAL A 224 59.354 -0.684 37.725 1.00 22.06 K 

ATOM 1B98 CA VAL A 224 59.111 -1.657 36.652 1.00 19.15 £ 

45 ATOM 1899 C VAL A 224 60.350 -2.490 36.333 1.00 25.89 £ 
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atom 1900 n mn» 60.28? -V109 36.2SO 1.0(l fl 

1901 CB VA1A »4 58.559 -1.022 35.377 1.00 22.59 £ 

ATOM 1902 CGI VM, A 274 58.512 -2.050 34.231 1.00 22.61 £ 

ATOM 1903 CS2 VM, A 224 57.161 -0.491 35.650 1.00 23.44 £ 

5 VTTM 1904 W MET A 225 61.499 -1 .838 36.255 1.00 27.83 H 

ATOM 1905 CA MET A 225 62.710 -2.577 36.004 1.00 23.69 £ 

ATOM 1906 C MET A 225 62.896 -3.678 37.071 1.00 31.95 £ 

ATOM 1907 Q MET A 225 63.290 -4.805 36.785 1.00 24.33 £ 

ATOM 1908 CB MET A 225 63.902 -1.604 36.056 1.00 21.34 £ 

10 MSM 1909 CS MET A 225 65.295 -2.296 35.999 1.00 17.83 £ 

ATOM 1910 SD MET A 225 6S.7S0 -2.958 34.306 1.00 23.33 £ 

ATOM 1911 Cg MET A 225 67.080 -1.896 33.785 1.00 16.46 £ 

Um 1912 M CLP A 226 62.644 -3.319 38.316 1.00 19.54 H 

him 1913 CA GLP A 226 62.988 -4.161 39.428 1.00 21.58 £ 

15 AX2M laii C eat A 226 61 .999 -5.200 39.918 1.00 30.77 £ 

ATOM 1915 O GLP A 226 62.308 -6.012 40.780 1.00 29.39 fl 

ATOM 1916 CB GLP A. 226 63.613 -3.323 40.547 1.00 20.47 £ 

him 1?17 gg glff ft 226 64.937 -2.673 40.122 1.00 23.03 £ 

ATOM 1918 CD GLP A 226 65.504 -1.809 41.208 1.00 32.62 £ 

20 fiifli 1213 obi gjffl ft 226 64.721 -1.4SS 42.122 1.00 26.12 £ 

hX3tl 1920 PE2 gJiB ft 226 66.711 -1.479 41.152 1.00 17.67 £ 

ATOM 1921 W LED A 227 60.837 -5.248 39.295 1.00 34.11 H 

ATOM 1922 CA LEU A 227 59.883 -6.296 39.642 1.00 35.26 £ 

ATOM 1923 C LEU A 227 60.537 -7.644 39.320 1.00 27.91 £ 

25 ATOM 1924 O LEP A 227 61.291 -7.766 38.340 1.00 19.89 £ 

ATOM 1925 CB LEU A 227 58.693 -6.236 38.678 1.00 36.48 £ 

ATOM 1926 CG LEU A 227 S7.3B1 -5.569 38.955 1.00 40.30 £ 

ATOM 1927 CD1 LEU A 227 57.697 -4.194 39.382 1.00 42.04 £ 

ATOM 1928 CD2 LEP A 227 56.610 -5.577 37.647 1.00 46.21 £ 

30 atom 1929 m ala a 22B 69.026 -8r68B 39. 955 1,00 27.15 H 

ATOM 1930 CA ALA A 228 60.425 -10.051 39.616 1.00 25.26 £ 

atom 1931 c ala ft 228 59.601 -10.135 38.279 1.00 27.93 £ 

atom 1932 o ala a 228 56,621 -10.093 37.931 1,00 31.26 2 

ATOM 1933 CB ALA a 228 60.003 -11,052 10.703 1.00 22.05 £ 

35 ATOM 1934 W UTS A 229 60.624 -11.160 37.539 1.00 27.05 H 

ATOM 1935 CA HTS A 229 60.275 -11.605 36.222 1.00 24.42 £ 

atom 1936 c hts a 229 58.905 -12.260 36.161 1,00 21,71 £ 

ATOM 1937 O HTS A 229 58.015 -11.851 35.398 1.00 22.22 O 

ATOM 1938 CB HTS A 229 61.351 -12.520 35.698 1.00 17.71 £ 

40 ATOM 1939 rr. HTS A 229 61.284 -12.701 34.220 1.00 27.24 £ 

ATOM 1940 wnl HTS A 229 61.060 -11.650 33.350 1.00 34.38 H 

ATOM 1941 CD 2 HTS A 229 61.292 -13.821 33.465 1.00 31.45 £ 

ATOM 1942 CRT HTS A 229 60.992 -12,113 32.115 1.00 30.50 £ 

ATOM 1943 WB2 HTS A 229 61.121 -13,127 32.159 1,00 35,2? H 

45 ATOM 1944 W GLP A 230 58.681 -13.161 37.140 1.00 20.24 H 
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&XPH 1945 PA GT.D A 230 ST . 425 -13.895 37 209 1.00 29.41 £ 

him 1946 C GI.0 A 230 56181 -13.051 3 7.341 1.00 22.20 £ 

ATOM *?47 p GLP X 230 55.159 -13.359 36 .679 1.00 17.78 £ 

AIPH 1948 a gLV A 239 57.464 -14.997 38.274 1.00 38.51 £ 

5 fiXQM 1?49 Cg gLV A 230 58.085 -14.582 39.567 1.00 63.09 £ 

AICM 1950 C p EM A 230 57.036 -14.473 40.661 1.00100.00 £ 

ftXSH 1251 PE1 CLV ft 230 ss.859 -14.872 40.4nn 1. 00100.00 3 

A1SM 1252 QE2 BUZ ft 230 57.409 -14.003 41.768 1.00 ai.48 q 

ftlSM 12S3_H VAX A 231 56.272 -12.004 38.182 1 .00 16.53 ft 

10 1954 CA VAL A 231 55.202 -11.029 38.3 56 1.00 20.23 £ 

AICM 1955 C VAL ft 231 55.009 -10.164 37.102 1.00 24.45 £ 

ftlOl 1956 P VM ft 231 S3.B64 -9.834 36.705 1.00 21.00 £ 

ftlCM 1957 CP VftL ft 231 55.541 -10.057 39.426 1.00 28.61 £ 

AICM 1950 Cgl VftL ft 231 54.362 -9.098 39.610 1.00 29.78 £ 

15 ftiai 1959 Cg2 VAL ft 231 55.881 -10.757 40.677 1-00 28:96 £ 

ftlCM 1960 H IBE ft 232 56.133 -9.798 36.486 1.00 17.17 H 

ftlSM 1961 Cft TRP ft 232 56.052 -9.044 35.262 1.00 21.52 £ 

1962 C TRP ft 232 55.388 -9.844 34.156 1.00 20.53 £ 

ftlSM 1963 P TRP ft 232 54.588 -9.306 33.380 1.00 24.31 Q 

20 1964 CB TRP ft 232 57.438 -B.644 34.801 1.00 29.88 £ 

1965 Cg TRP ft 232 57.430 -7.843 33.500 1.00 27.65 £ 

ai£M 1966 CP1 TRP ft 232 57.184 -6.464 33.356 1.00 25.42 £ 

AICM 1967 CP2 TRP ft 232 57.714 -8.336 32.169 1.00 27.75 £ 

AXCM 1960 WEI TRP ft 232 57.325 -6.095 32.033 1.00 22.53 JJ 

25 ATPM 1969 CE2 TRP A 232 57.655 -7.203 31.279 1.00 25.11 £ 

ftTSM 1970 CB3 TRP ft 232 58.037 -9.603 31.640 1.00 22.72 £ 

AXCM 1971 C22 TRP ft 232 57.917 -7.316 29.979 1.00 17,23 £ 

ftISM 1972 CZ3 TRP ft 232 58.238 -9.720 30.223 1.00 25.97 £ 

aiSM 197? cp2 TRP ft 232 58.154 -a.sai 29.368 1.00 22.07 £ 

30 ftiai 1974 W LEV ft 233 55.749 -11.121 34.018 1.00 23.80 H 

ATP M 1975 Cft LEV ft 233 55.141 -11.949 32.937 1.00 24.78 £ 

&I2M 1976 C LEV ft 233 53.652 -i2.ua 33.122 i.oo 24.51 £ 

&TSM 1977 p LEV ft 23? 52.865 -12.075 32.163 1.00 28.50 £ 

ATPK 197P CP LEV ft 233 55.765 -13.348 32.820 1.00 26.20 £ 

35 ATPtt 1979 CG LEO A 233 57.250 -13.505 32.503 1.00 19.39 £ 

ATPtt 1980 CP1 LED A 233 57.745 -14.850 33.0 23 1.00 19.90 £ 

ftlflM 1981 CD2 LEU A 233 57.561 -13.287 31.0 17 1.00 16.01 £ 

ftXOl 1992 N GLD A 234 53.298 -12.343 34.372 1.00 2S.45 JJ 

ATPM 1983 CA 6LU A 234 51.929 -12.523 34.822 1.00 30.04 £ 

40 ftiaa 1994 C GLU A 234 51.128 -11.319 34.367 1 .00 35.69 £ 

AI2M 1995 P SLV ft 231 49.926 -11.390 34.052 1.00 28.25 Q 

ftlOl 1996 gp SLH ft 234 52.007 -12.46B 36.344 1.00 37.30 £ 

MEM 1997 Cg gLV ft 231 50.908 -13.133 37.118 l.OO 45.39 £ 

ATPM 19BB CD GLU A 234 51.112 -12.881 38.601 1.00100.00 £ 

45 him 1999 PE1 gLV ft 234 52.240 -13.137 39.104 1.00 99.09 fl 
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ATOM 1990 OE2 GLO A 50.711 -12.257 39.211 1.00100.00 Q 

ATOM 1991 W ASN A 235 51.802-10.184 34.364 1.00 25.04 H 

ftEM 1992 Cft ASN ft 235 81. IP? -8.986 33,992 l.po 26,17 £ 

atom 1993 c ftSN ft 235 51.280 -8.494 32,571 l.PP 30.46 £ 

5 atom 1994 o asm a 235 5P.B24 -7.393 32.259 l.PP 22. 9P £2 

ATOM 1995 CB ASN A 235 51.427 -7.895 34.981 1.00 29.23 £ 

ATOM 1996 CG ASM A 23S 50.878 -B.197 36.342 1.00 39.27 £ 

ATOM 1997 PHI ASM A 235 49.722 -7.882 36.628 1.00 29.06 Q 

ATOM 1998 MD2 ASN A 235 51.653 -8.934 37.140 1.00 40.22 H 

10 KIM 1999 H IBB ft 236 51.935 -9.268 31.708 1.00 20.97 N. 

him 2PPP CA THR ft 236 S2.108 -8.795 30.344 1.00 22.30 £ 

ATOM 2001 C THR A 236 51 .867 -9.943 29.419 1.00 29.74 £ 

ftTSM 2002 P THR ft 236 51.551 -11.033 29.895 1.00 21.23 fl 

KXSM 20P3 CB THR ft 236 53.545 -8.306 30.161 1.00 22.73 £ 

15 Kim 2PP4 951 THR ft 236 54.422 -9.325 3P.636 l.PP 21.2? 2 

Kim 2PP5 Cg2 THR ft 236 53.801 -7.048 31.041 l.OO 19.69 £ 

KIM 2PP6 V SLM ft 237 52.003 -9.699 28.109 1.00 22.23 H 

Kim 2P07 Cft <?IN ft 237 52.097 -10.783 27.122 1.00 16.69 £ 

ATOM 2008 C SLH ft 237 53.335 -10.507 26.331 1.00 21 .02 £ 

20 Kim 2009 O SLH A 237 53.729 -9.362 26.204 1.00 22.19 £ 

ATOM 2010 CB SIM A 23? 50.913 -10.999 26.189 1.00 8.23 £ 

ATOM 2011 CS GLM A 237 49.639 -11.096 26.904 1.00 21 .04 £ 

Kim 2P12 CP £M ft 237 48.907 -9.862 26.606 1.00 62.07 £ 

Kim 2013 OE1 CLN A 237 48.437 -9.712 2S.460 1.00 S9.32 fi 

25 A1QJ3 2P14 NE2 CLN ft 237 49.220 -8.847 27.388 1.00 37.82 H 

Kim 2015 N PRP ft 238 54.PP2 -11.579 25.917 1,QP 28.76 U 

Kim 2P16 Cft PRP ft 238 55.275 -11.438 25.246 1.00 30.28 £ 

ftlffH 2P17 C PRP ft 238 55.194 -10.643 23.958 1.00 29.08 £ 

ATOM 2018 O PRO A 238 56.181 -10.029 23.600 1.00 15.95 £ 

30 Kim 2P19 C^ PRP ft 238 55.733 -12.879 25.011 1.00 22.54 £ 

ATOM 2020 CG PRO A 238 54.898 -13.710 25.886 1.00 18.92 £ 

ATOM 2021 CD PRO A 238 53.626 -12.998 26.068 1.00 11.75 £ 

Kim 2022 N MET A 239 54.041 -10.635 23.286 1.00 17.26 

Kim 2023 CA MET A 239 53.924 -9.807 22.104 1.00 17.85 £ 

35 ATOM 2024 C MET A 239 53.109 -8.509 22.362 1.00 18.63 £ 

Kim 202S O MET A 239 52.792 -7.741 21.419 1.00 16.82 fl 

Kim 2026 CB MET A 239 53.460 -10.588 20.881 1.00 15.22 £ 

Kim 2P27 CP WET ft 239 54.536 -11.534 2P.261 l.PP 12. 9p £ 

Ki m 2028 SD MET A 239 53.994 -12.534 18.808 1.00 17.49 £ 

40 ATOM 2P2<? CE WET ft 239 54.350 -n.357 n.422 i.oo 13.12 £ 

Kim 2P3P H LEV ft 24P 52.847 -8.252 23.646 1.00 18.55 N 

M3i 2P31 Cft LEV ft 24P 52.159 -7.037 24.131 1.00 16.68 £ 

hlSSi 2P32 C IEP ft 24P 52.774 -6.733 25.493 1.00 11.82 £ 

Kim 2P3? Q LEV ft 24P 52.124 -6.803 26.549 1.00 13.84 £ 

45 him 2034 CB LEU A 240 50.645 -7.249 24.240 1.00 16.91 £ 
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AT CM j>035 Cg LEV A 240 49.646 -6.ua ?^B«i> i.on 79.99 £ 

ATOM 2036 PHI T.KU A 240 4B.96B -5.488 2fj,033 1.00 2551 £ 

ATCM 2037 gp2 A 24P 50.070 -5.059 22.815 1.00 28.07 £ 

ATOM 2038 N SBB A 241 54.076 -6.467 25.456 1.00 T3.09 H, 

5 ATOM 2039 CA SER A 241 54.842 -6.315 26.682 1.00 24.20 £ 

ATOM 2040 C SER ft 241 54.947 -4.938 27.377 1.00 30.52 £ 

ATOM 2041 O SER A 241 55.363 -4.854 28.547 1.00 17.02 Q 

ATOM 2042 CB SER A 241 56.247 -6.900 26.495 1.00 14.04 £ 

ATOM 2043 OG SER A 241 57.062 -6,144 25.598 l.pp 13,95 £ 

10 ATOM 2044 H HIS A 242 54.661 -3.B61 26.659 1.00 17.87 H 

ATOM 2045 CA HIS A 242 54.894 -2.548 27.221 1.00 13.55 £ 

ATOM 2046 C HIS A 242 53.990 -2.254 28.373 1.00 13.70 £ 

AI£U 2047 Q HIS A 242 S2.974 -2.885 28.539 1.00 13.29 £ 

AZQU 2048 CB HIS A 242 54.826 -1.430 26.130 l.OO 16.05 £ 

15 ATOM 2049 CG HIS A 242 53.595 -1.504 25.272 1.00 18.88 £ 

bXm 2050 NP1 HIS A 242 52.591 -0.553 25.326 1.00 23.24 H 

AieM 2051 CP2 HIS A 242 S3.16S -2.461 24.413 1.00 13.19 £ 

AISM 2052 £E1 HIS A 242 51.629 -0.887 24.483 1.00 17.44 £ 

him 2053 HE2 HIS A 242 51.962 -2.031 23.901 1.00 19.54 H. „ 

20 ATJ21 2054 N TLB A 243 54.310 -1.203 29.09S 1.00 15.84 H . 

ATOM 2055 CA ILE A 243 53.492 -0.809 30.192 1.00 19.10 £ 

ATJ21 2056 C ILE ft 243 53.336 p. 714 30.191 1.00 2?, 23 £ . 

Alffll 2057 o ILE A 243 54.312 1.406 30.385 1.00 12.10 Q 

ATOM 2058 CB TLB A 243 54.166 -1.273 31.482 1.00 24.62 £ 

25 ATOM 2059 CGl ILE A 243 54.014 -2.783 31.576 1.00 25.60 £ 

&XSM 2Q60 cg2 ile a 243 53.497 -Q.665 32.735 1.00 17.37 £ 

AICM 2061 CDl ILE A 243 54.725 -3.365 32.714 1 .00 14.82 £ 

AlfiM 2062 H ASW A 244 52^112 1.217 30.013 1,00 16.43 U 

ATOM 2063 CA ASK A 244 51.824 2.689 30.038 1.00 18.99 £ 

30 ATOM 2064 C ASH A 244 52.252 3.292 31.348 1.00 18.83 £ 

ATOM 2065 O ASH A 244 SI. 965 2.727 32.405 1.00 19.58 fl 

ATOM 2066 CB ASH A 244 50.304 2.9B7 29.910 1.00 15.67 £ 

ATOM 2067 CG ASH A 244 49.768 2.702 28.517 1.00 14.57 £ 

A12M 2068 opi ash a 244 50.546 2.583 27.580 1.00 13.64 Q 

35 filCU 2069 HD2 ASH A 244 48.443 2.491 28.393 1.00 10.16 H 

AlOH 2070 H VAL A 245 52.8P0 4,499 31.326 1,PP 13.50 H 

hJSm 2071 ca val a 245 53.15? S.134 32.602 1,00 13.49 £ 

ATOM 2072 C VAL A 245 52.528 6.566 32.644 1.00 16.25 £ 

ATOM 2073 O VAL A 245 52.786 7.405 31.770 1.00 15.20 £ 

40 atom 2074 cb val a 245 54.754 5.163 32.810 1.00 21.P7 £ 

AXat 2075 cgj VAJ, A 245 55.154 6.P85 33.937 1,PP 15, PP £ 

Alfflf 2P76 CC2 VAL A 245 55.28P 3.817 33.143 1.00 15.82 £ 

ATOM 2077 H GLY A 246 51.696 6.843 33.649 1.00 14.03 H 

ATOM 2078 CA GLY A 246 51.027 8.136 33.707 1.00 16. B7 £ 

45 ATOM 2079 C GLY A 246 50.146 8.203 34.939 1.00 26.95 £ 
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OEM 2090 0 gLY ft Elf? 7.401 35.as o i.oo ?i.m Q 

AX2M 2081 N THR ft 247 49.207 9. 161 34.963 1.00 21.44 g 

ftlfiM 2082 CA THR A 247 48.232 9.276 36.063 1.00 7139 £ 

ftlCM 2083 C THR ft 24? 46.B6B B. 677 35.673 1.00 24.06 £ 

5 ftXBi 208< Q — THR ft 217 46.069 a.306 ae.sn a l.on 2103 q 

ftlflM 2085 CB THR A 247 47.9B8 10.730 36.404 1.00 22.24 £ 

AlflM 2086 OS! THR ft 247 47.409 n.389 3S.?fis i.oo ib.62 Q 

aiCM 2087 Cg2 THR ft 247 49.275 n.37 8 36.724 i.oo ib «« c 

ftlSM 2im H GLY A 24B 46.583 B.651 34.384 1.00 24. 9S 

10 aXflM 208? Cft gLY A 218 45.319 B.143 33.924 i.oo 27 61 £ 

2030 C gLY ft 248 44.22? 9.160 34.226 i.oo 21.47 £ 

ABM 2091 g gLY ft 248 43.05? B.B66 34.1 37 i.oo 25.70 q 

20?2 W VftL ft 24? 44.615 10.386 34.571 1.00 30.72 h 

ATSM 20?3 Cft VAL A 24? 43.673 11 .464 34.827 1.00 26.09 £ 

15 AISJM 20?4 C VAL A 24? 43.747 12.596 33.786 1.00 37 ,, 70 £ 

AlfiM 2095 Q VAL A 24? 44.853 13.006 33 387 1.00 76.9? £ 

AXflM 20?6 CB VM, ft 24? 44.020 12.085 36.214 1.00 38.50 £ 

AXOl 20?7 Cgl VftL ft 24? 43.225 13.324 36.470 1.00 36.11 £ 

AXflM 20?8 Cg2 VAL A 249 43.782 11 .0B3 37.306 1.00 41.30 £ 

20 20?? W ASP A 250 42.581 13.125 33.307 1.00 77. gg U 

AXflM 2100 CA ASP A 250 42.4BB 14. 232 32.439 1.00 20.64 £ 

filSM 2101 C ASP A 250 42.611 15.581 33.155 i.oo £ 

AT2M 2102 0 ASP A 250 42.188 15.783 34.308 1.00 26.73 £ 

AXCM 2103 CB ASP A 250 41.075 14.302 31.827 1.00 23.B9 £ 

25 ATai 2104 Cg ASP ft 250 40.768 13.180 30.850 1 .00 30.52 £ 

AXCM 2105 0P1 ASP ft 250 41.283 13. 184 29.688 1.00 39.96 £ 

ATCM 2196 PP2 ASP ft 250 39.767 12. 501 31.153 1.00 45.34 £ 

AIGH 2107 N CYS ft 251 43.029 16. 566 32.3BB 1.00 20.12 JJ 

2108 Cft CYS ft 251 42.962 17.906 32.851 1.00 77 70 £ 

30 210? C CYS ft 251 42.918 18.779 31.577 1.00 26.47 £ 

AlflM 2110 P CYS ft 251 43.699 18. 560 30.633 1.00 19.45 £ 

Aiai 2111 CB CYS ft 251 44.148 18.1 57 33.77B 1.00 34. B6 £ 

2112 SG CYS ft 251 45.129 19.619 33.453 T.00 29.47 £ 

AISM 2113 W THR A 252 41.932 19.673 31.494 1 .00 14.85 J* 

35 Aim 2114 CA THR ft 252 41.834 20.588 30.335 1.00 21.21 £ 

AISM 2115 C THR A 252 42.999 21.592 30.736 1.00 20.53 £ 

AICM 2118 P THR A 252 43.657 21.926 31.249 1.00 15.24 £ 

MSM 2117 CB THR A 252 40.506 21.407 30.3 29 1.00 37 08 £ 

MflM 2118 Pffl THR ft 252 40.460 22.304 31.447 1.00 19.2c £ 

40 AXSM 2119 CG2 THR A 252 39.309 20.495 30.3 72 1.00 13.01 £ 

A12M 2120 N 1MB ft 25? 43.228 22.095 29.024 1.00 14.81 H 

AISM 2121 Cft ILE ft 25? 44.264 23.118 28.812 1.00 16.90 £ 

AISM 2122 C ILE A 25? 43.934 24.3B3 29.627 1.00 73.41 £ 

AlflM 2123 P ILE ft 253 44.834 25.012 30.247 1.00 15.27 £ 

45 Aiai 2124 CB ILE ft 253 44.404 23.452 27.302 1.00 24.05 £ 
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atom 2125 tt.b a 253 lUfl 22.200 ?fi.561 1.00 27.33 C 

ATOM 2126 CC2 TLB A 253 1.00 9.22 £ 

atom 2127 cm tlb a 253 4S,fj62 21.276 27.152 l.PP 19.56 £ 

atom 2128 w arc a 254 4? , £33 21.709 P9.707 l.PP 19.56 H 

5 ATOM 2129 CA AUG A 254 4?,2?B 25.865 30.522 1.00 19.11 £ 

atom 2130 c arg a 254 12,712 25.713 31.970 l.PP 16. IP £ 

atom 2131 n aug a 254 13.311 26.616 32.515 l.PP 13.89 2 

atom 2132 cb arg a 254 10.701 26.101 30.160 l.PP 15.98 £ 

atom 2133 rc, arg a 254 10.282 27.37ft 31,755 l.PP 9.96 £ 

10 atom 2134 cn arg a 254 38.809 27.7P2 31.218 l.PP 21.79 £ 

atom 2135 nb arg a 254 38.198 28.111 29.997 1.00 29.12 H 

ATOM 2136 Cg ARG A 254 38.693 29.723 29.794 1.00 59.85 £ 

atom 2137 mm arg a 254 39.191 30.527 30.732 1.00 12.56 II 

ATOM 213B MH2 ARG A 254 38.377 30.245 2B.620 1.00 18.44 H 

15 atom 2139 w asp a 255 12,106 21.561 32.586 1.00 20.22 H 

atom 2140 ca asp a 255 12,795 21.205 33.971 l.PP 16.18 £ 

atom 2141 c asp a 255 11.321 21.372 31.069 1.00 22.13 £ 

ATOM 2142 O ASP A 2 55 44.868 24.897 35.060 1.00 18.53 2 

atom 2143 cb asp a 255 12,178 22.686 31,157 1.00 19.17 £ 

20 ATOM 2144 CG ASP A 255 42.144 22.246 35.610 1.00 47.08 £ 

atom 2145 opi asp a 255 11,780 23.090 36,129 1.00 19.66 Q 

ATOM 2146 Qn2 ASP A 255 42.020 21.016 35.880 1.00 48.12 2 

atom 2147 w T.Bii a 256 15,011 23.809 33.078 1.00 15.98 H 

ATOM 2148 CA T.BUA 2S6 46.465 23.844 33.069 1.00 21.76 £ 

25 ATOM 2149 C T.EP A 2 56 47.020 25.275 33.076 1.00 16.79 £ 

ATOM 2150 O T.EU A 2 S6 47.B25 25.697 33.946 1.00 15.24 Q 

ATOM 2151 CR LEU A 256 46.967 23.056 31.859 1.00 23.33 £ 

atom 2i 52 cc t.eu a 256 18,191 23.100 31.765 1.00 26.60 £ 

atom 21S3 cm leu a 256 19.171 22.331 32.981 1.00 17.13 £ 

30 atom 21S4 cd2 leu a 256 19,010 22,721 30.316 ItPP 15.12 £ 

atom 2155 w ata a 257 16,520 26.018 32.110 1.00 13.77 H 

ATOM 2156 CA ALA A 2S7 4 6.938 27.436 32.025 1.00 12.70 £ 

atom 2157 c Ala a 257 16.656 28.237 33.267 1.00 10.73 £ 

ATOM 2158 O ATA A 257 47.451 29.073 33.672 1.00 20.33 S 

35 atom 2159 cb ala a 257 16,208 28.073 30.831 1.00 13.31 £ 

ATOM 2160 M GLW A 25B 45.470 28.080 33.835 1.00 12.40 II 

atom 2161 ca glk a 258 15.102 28.911 31,981 1.00 8.39 £ 

ATOM 2162 C GLW A 25B 4S.B79 28.480 36.166 1.00 13.18 £ 

ATOM 2163 O GLW A 25B 46.179 29.281 37.029 1.00 22.96 £ 

40 ATOM 2164 CB GLW A ?5B 43.614 28.761 35.305 1.00 16.12 £ 

atom 2165 cg glw a 258 12,671 29.09* 31.130 1.00 30.19 £ 

atom 2166 cd glw a 258 12.571 30.585 33.781 1 .00 37.29 £ 

ATOM 2167 OBI GLW a 258 12.911 31.171 31.610 1.00 21.21 2 

ATOM 2168 MB2 GLW A 258 12,021 30.87f? 32.572 1.00 15.91 H 

45 ATOM 2169 N THR A 259 46.179 27.182 36.232 1.00 16.21 H 
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AISM 2170 CA TOR ft 252 46.98? 26.678 37.336 1.00 16.85 £ 

AX£M 2171 C THR A 259 48.410 ?7.1B6 37.233 1.00 20.56 £ 

AISM 2172 P THR A 259 49.00? 27.621 38.214 1.00 21.44 £ 

AISM 2112 CB THR A 259 47.066 25.192 37.361 1.00 27.56 £ 

5 aism — 2174 pgi tor a 259 45.752 24.620 37.509 1.00 20.92 q 

AISM 2175 Cg2 TOR A 259 47.936 24.796 38.545 1.00 12.85 £ 

AISM 2176 W ILE A 260 48.952 27.170 36.028 1 .00 10.96 

AISM 2177 CA I1E A 26P 50.292 27.704 35.839 1.00 23.01 £ 

AISM 2178 C ILE A 26P 50.313 29.180 36.225 1.00 31.73 £ 

10 AICM 2179 g ILE A 26P 51.211 29.627 36.993 1.00 ?S.90 £ 

AISM 2180 CB ILE A 2£fl 50.B3S 27.456 34.390 1.00 22.46 £ 

AISM 2181 Cgl ILE A 269 51.153 25.940 34.232 1.00 24.12 £ 

ATOM 2182 C<?2 ILE A 26P 52.099 28.361 34.106 1.00 13.47 £ 

AISM 2183 CP1 ILE A 260 51.501 25.443 32.810 1.00 12.58 £ 

15 AISM 2184 W ALA A 261 49.280 29.910 35.764 1.00 15.35 H 

ATOM 2185 CA ALA A 261 49.177 31.355 36.048 1.00 16.00 £ 

ATOM 2186 C ALA A 261 49.316 31.604 37.550 1.00 20.58 £ 

AISM 2187 P ALA A 261 50.104 32.443 37.987 1.00 16.09 £ 

ATOM 2188 CB ALA A 261 47.832 31.958 35.487 1.00 13.65 £ 

20 AISM 2189 W LYS A 2£Z 4B.S51 30.843 38.323 1.00 11.50 H 

AISM 2190 CA LYS ft 262 48.578 30.905 39.770 1.00 10.13 £ 

AISM 2121 £ LYS A 262 49.968 30.460 40.296 1.00 28.08 £ 

AISM 2122 — 2 LYS A 262 50.503 31 .084 41.205 1.00 20.37 £ 

AISM 2193 CB LYS A 132 47.453 30.032 40.335 1.00 i?.so £ 

25 AISM 2194 Cg LYS A 262, 47.332 29.962 41.888 1.00 16.51 £ 

AlfiM 2195 CP LYS A 262 46.092 29.092 42.371 1.00 46.61 £ 

AISM 2196 CE LYS A 262 46.344 27.555 42.661 1.00 99.70 £ 

AISM 2197 NZ LYS A 26? 45.157 26.703 43.200 1.00 36.59 H 

&ISM 2198 W VAL A 263 50.589 29.443 39.705 1.00 17.44 ft 

30 AISM 2199 CA VAL A 263 51.915 29.039 40.171 1.00 18.72 £ 

AISM 22QP C VAL A ?63 52.997 30.170 39.997 1.00 32.12 £ 

AISM 22P1 P VAL A 263 53.871 30.412 40.834 1.00 21.18 £ 

AISM 22P2 CB VAL A 263 52.389 27.709 ^.476 1.00 16.35 £ 

AISM 22P3 Cgl VAL A 26? 53.920 27.518 39.647 1.00 11.83 £ 

35 &X£M Z2M CG2 VAL A 263 51 .646 26.522 40.093 1.00 14.99 £ 

ATOM 2U2S M VAL A 264 52.913 30.899 38.909 1.00 21.75 

AISM 2206 CA VAL A 264 53.917 31.877 38.653 1 .00 19.81 £ 

AISM 22P7 C VAL A 264 53.719 33.208 39.377 1.00 35.79 £ 

AISM 22P8 p VAL A 264 54.632 34.032 39.482 1.00 28.99 £ 

40 AISM 22Q9 Vftlifl 264 54.059 32.014 37.175 1.00 24.27 £ 

AISM 221P Cgl VAL A 264 54.728 33.269 36.822 1.00 33.58 £ 

ATOM 2211 Cg2 VAL A 264 54.840 30. B OB 36.674 1.00 23.01 £ 

AISM 2212 H gLY A 265 52.550 33.378 39.969 1.00 25.30 fi 

AISM 2213 CA GLY A 265 52.241 34.6?0 40.636 1.00 24.14 £ 

45 AISM 2214 C GLY A 265 51.730 3S.694 39.632 1.00 3* 03 £ 
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2260 CA VftL A 271 42.666 34.492 27.487 1.00 28.32 £ 

AlflM 2261 C VftI ft 271 42.819 33.370 26.44? 1.00 24. 89 £ 

2262 P Vftlt ft 271 43.923 33.115 25.980 1.00 21.98 2 

AXQtl 2263 CB VAL ft 271 42.901 3S.813 26.736 1.00 29.25 £ 

5 AXSM 2261 Cgl VftL ft 271 42.256 35.773 25.370 1.00 31.91 £ 

ftXCM 226? CC2 VftL ft 271 42.421 36.989 27.565 1.00 18.72 £ 

ftXO! 2266 W PHB ft 272 41.716 32.758 26.019 1.00 26.14 H 

iSSH 2267 CA PHB A 272 41.7S2 31.747 24.963 1.00 24.34 £ 

ABM 2268 C PHB A 272 41.236 32.266 23.623 1.00 28.95 £ 

10 ftlCM 226? P PHB A 272 40.155 32.826 23.582 1.00 22.01 £ 

227P CB PHB ft 272 40.960 30.506 25.391 1.00 20.97 £ 

2271 Cg PHB ft 272 41.764 29.570 26.243 1.00 21.77 £ 

OEM 2272 CP1 PHB ft 272 41.940 29.842 27.610 1.00 14.60 £ 

2273 CP2 PHB ft 272 42.504 28.550 25.656 1.00 22.19 £ 

15 227^ CE1 PHB A 272 42.763 29.041 28.434 1.00 17.89 £ 

AI2M 2275 CE2 PHB A 272 43.336 27. 726 26.454 1.00 27.64 £ 

AEM 2276 Cg PHB A 272 43.478 27. 979 27.851 1.00 25.14 £ 

MEM 2277 W ASP A 273 42.012 32.114 22.542 1.00 29.45 H 

AICM 2278 CA ASP ft 273 41.557 32.536 21.214 1.00 22.33 £ 

20 227? C ASP A 273 40.896 31.365 20.493 i nn 25 67 c 

AIOl 2280 p ASP A 273 41.539 30.570 19.793 1.00 17.81 £ 

AX21 2281 CP ftSp ft ?73 42.672 33.114 20.343 1.00 21.45 £ 

2282 CG ASP A 273 42.131 33.626 18.990 1.00 26.89 £ 

Aiai 22P3 PP1 ASP A 273 40.975 33.249 18.598 1.00 27.76 Q 

25 AlflM 2284 OD2 ASP A 273 42.838 34.421 18.327 1.00 30.06 Q 

22«5 N ALA A 271 39.589 31.284 20.649 1.00 15.59 H 

aiCM 2286 CA ALA A 271 38.932 30. 128 20.128 1.00 23.75 £ 

aiflM 2287 C ALA A 271 38.853 30. 168 18.653 1.00 32.30 £ 

2288 0 ALA A 271 38.284 29.256 18.029 l.OO 29.37 £ 

30 aiOl 228? CB ALA A 271 37.567 29.905 20.777 1.00 18.87 £ 

AlfiH 2290 N SEE A 275 39.372 31.243 ib.qbi i.oo 21.10 u 

2291 CA SEE A 275 39.343 31.288 16.631 1.00 26.90 £ 

AXflM 2292 C SEE A 275 40.390 30.300 16. lie i.oo 43.37 £ 

aiSM 2293 P SER A 275 40.421 29.949 14.927 1.00 46.32 £ 

35 AlfiH 22?1 CB SER A 275 39.547 32. 683 16.074 1.00 15.19 £ 

&X24 2295 Pg SEP ft 275 40.904 33.070 16.078 1.00 28.71 £ 

OEM 2296 W MCg A 276 41.192 29.780 17.037 1.00 22.98 ft 

2297 CA LYS A 276 42.178 28.791 16.638 l.OO 23.28 £ 

AXOi 2298 C LYS A 276 41.645 27.405 16.976 1.00 29.73 £ 

40 22?? P EYS A 276 40.992 27.206 18.010 1.00 25.10 £ 

aiCH 23P0 CB LYS A 276 43.544 29.051 17.275 1.00 19.19 £ 

AlflM 23P1 CP LYS A 276 43.957 30.496 17.218 1.00 32.11 £ 

AI£tJ 2382 CP LYS A 276 44.062 30.BS2 15.79a i.oo 22.43 £ 

&ICM 2383 CB LYS A 276 44.930 32.067 15.570 1.00 23.18 £ 

45 ATPM 2381 NZ LYS A 276 45.454 32.117 14.152 i.oo 29.4? U 
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ATCM 2305 N PRP A 277 13 Jg2 ""ft is.ns'i l.oo 36.04 h 

AI2M 2306 CA PRO A 277 41.446 25.087 16 170 1.00 35.93 £ 

&X21 2307 C PRO A 277 42.022 24.332 17.363 1.00 29.30 £ 

him 2308 0 PRO A 277 43 103 24.650 n ans 1.00 30.54 q 

5 him 230? CB PRO A 277 41.975 24.453 14.878 1.00 39.65 £ 

hISM 2310 Cg PRg A 277 43.249 25.261 14.566 1.00 42.90 £ 

him 2311 CD PRO A 277 42.787 26.670 14.892 1.00 37.84 £ 

him 2312 H ASP A 27P 41.273 23.339 17.809 1.00 22.35 £ 

MSM 2313 CA ASP A 276 41.745 22.501 ia.903 l.oo 22. 16 £ 

10 him 2114 C ASP A 278 42.184 21.189 18.272 1.00 19.66 £ 

him 2315 P ASP A 278 41.905 20.917 17.117 1.00 23.49 £ 

him 2316 CB ASP A 278 40.636 22.241 19.971 1.00 15.09 £ 

him 2317 Cg ASP A 278 40.216 23.503 20.702 1.00 22.86 £ 

him 2318 PP1 ASP A 278 41.113 24.254 21.096 1.00 25.18 Q 

15 him 231? PP2 ASP A 2n 38.999 23.787 20.812 1.00 39.55 £ 

him 2320 M glY A 27? 42.846 20.355 19.044 1.00 30.65 ft 

him 2321 CA SLY A 27? 43.229 19.034 18.546 1.00 33.78 £ 

him 2322 C ffLY A 27? 42.115 1B.099 18.944 1.00 38.10 £ 

him 2323 P glY A 27? 40.963 lB.sn 19.068 1.00 47.52 £ 

20 him 2321 N THR A 280 42.419 16.839 19.177 1.00 29.44 N. 

him 2325 CA THR A 280 41.32B 15.990 19.587 1.00 26.68 £ 

him 2326 C THR A 280 40.889 16.439 20.972 1.00 23.52 £ 

him 2327 P THR A 280 41.670 17.067 21.713 1.00 23.62 £ 

him 2328 CP THR A 280 41.695 14.492 19.540 1.00 40.78 £ 

25 him 2329 OG1 THR A 280 42.B89 14.272 20.296 1.00 25.56 £ 

him 233P Cg2 THR A 280 41.893 14.054 18.095 1.00 37.71 £ 

him 2331 M PRO A 281 39.672 16.063 21.346 1.00 25.54 H 

him 2332 CA PRP A 281 39.129 16.454 22.628 1.00 25.72 £ 

ATOM 233 3 C PRO A 281 39.776 15.778 23.800 1.00 26.02 £ 

30 him 2331 P PRO A 281 39.752 16. 314 24.915 1.00 22.68 £ 

him 2335 CB PRO A 281 37.650 15,??0 22.55? 1,P0 28.8? £ 

him 2336 Cg PRP A 281 37.417 15.540 21.201 1.00 29.39 £ 

him 2337 CD PRO A 281 38.761 15.138 20.646 1.00 26.82 £ 

him 2338 W BBS A 282 40.281 14.567 23.587 1.00 27.88 H 

35 him 233? CA ARg A 282 40.806 13.817 24.720 1.00 34.08 _£ 

him 2340 C ARG A 282 41 .977 12.91B 24.384 1.00 27.62 £ 

him 2311 P ARG A 282 41.913 12.182 23.425 1.00 23.83 fl 

him 2342 CB ARG A 282 39.676 13.017 25.405 1.00 20.89 £ 

ATOM 2343 CG ARG A 282 40.035 12.467 26.775 1.00 22.81 £ 

40 him 2344 CD ARG A 282 38.762 11.925 27.442 1.00 26.77 £ 

him 2345 MB ARG A 282 38.963 11.345 28.781 1.00 36.48 M 

him 2346 CZ ARG A 282 38.S18 10.139 29.164 1.00 37.74 £ 

him 2347 WH1 ARG A 282 37.B13 9.360 28.346 1.00 28.45 N. 

him 2348 WH2 ARG A 282 38.754 9.700 30.384 1.00 27.25 N. 

45 him 231? N T.VS A 283 43.016 12.963 25.223 1.00 28.91 H 
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AlflM 235P. CA LIS A 283 iLJtU **.ni ?s.osi i.oo 24.32 £ 

AISM 2351 C LYS A 283 44.796 11.766 26.404 I.OO 29. S7 £ 

AISM 2352 P LYS A 283 45.262 12.626 27.138 1.00 33.16 £ 

AlflM 2353 CB LYS A 283 45.226 13.008 24.287 1.00 21.93 £ 

5 AlfiM 2354 Cg LYS A 283 46.111 12.251 23.316 l.OO 32.38 £ 

AISM 2355 CP LYS A 283 46.526 13.171 22.143 l.OO 95.77 £ 

AISM 2356 CE LYS A 283 45.710 12.937 20.8 36 1.00100.00 £ 

AISM 2357 WZ LYS A 283 46.418 13.332 19.535 1.00100.00 H 

AISM 2358 W LEV A 284 44.747 10.467 26.734 1.00 23.37 

10 AISM 2359 CA LEV A 284 45,327 9.905 27.997 1.00 16. 08 £ 

AISM 2360 C LEV A 284 15, 463 8.386 28.047 l.OO 20.46 £ 

AlfiM 2361 P LEV A 284 44.679 7.655 27.4 46 i.oo 25.45 q 

AISM 2362 CP LEV A 284 44.641 10. 387 29.284 1.00 16.30 £ 

AlfiM 2363 CP LEV A 284 13,334 9.700 29.71 4 i.oo 25.97 £ 

15 AlfiM 2364 CP1 LEV A 284 42.881 10.089 31.152 i.oo 22.11 £ 

AlflM 2365 CD2 LEV ft ?B4 42.203 9.9 53 28.693 1.00 73.92 £ 

AlfiM 2366 N LEV A 285 46.453 7.939 28.820 1.00 18.51 H 

AlfiM 2367 CA LEU A 285 46.792 6.527 29.003 1.00 16.77 £ 

AlfiM 2368 C LEV A 285 45.880 5. 865 30.006 1.00 30.75 £ 

20 AlfiM 22i&_2 LEV A 285 45,576 6.439 31.058 1.00 22.02 q 

AISM 2370 CP LEV A 285 48,229 6.389 29.585 1.00 is. 85 £ 

AlflM 2371 Cg LEV A 285 49.307 6.970 28.672 1.00 21 .51 £ 

AlflM 2372 CP1 LEV A 285 50.703 6.705 29.122 i.oo 15.15 £ 

AlfiM 2373 CP2 LEV A 285 49. 051 6.368 27.33 0 1.00 16.94 £ 

25 AlflM 2374 W ASP A 286 45.565 4.599 29.734 l.OO 26.62 u 

AlflM 2375 CA ASP A 286 44.945 3.726 30.698 1.00 10.90 £ 

AlfiM 2376 C ASP A 286 46.128 3.055 31.498 1.00 20.54 £ 

AlfiM 2377 P ASP A 286 46,991 2.372 30.938 i.oo 23.38 q 

AlfiM 2378 CP ASP A 286 44.073 2.702 29.970 1.00 14.65 £ 

30 AISM 2379 Cg ASP A 286 43.409 1.699 30.943 i.oo 24.60 £ 

AISM 2380 OP1 ASP A 286 43.932 1. 437 32.083 1.00 24.60 £ 

AISM 2381 OD2 ASP A 286 42.316 1.231 30.583 1.00 26.03 £ 

AISM 2382 V VAL A 287 46.230 3. 317 32.791 1.00 15.44 E 

AlfiM 2383 CA VAL A 287 47,354 2.816 33.556 i.oo is.58 £ 

35 AISM 2384 C VAL A 287 46,973 1.695 34.521 l.OO 16.48 £ 

AISM 2385 P VAL A 287 47.613 1. 473 35.572 1.00 16.63 Q 

AISM 2386 CP VAL A 287 48.101 4.006 34.260 l.OO 29.84 £ 

AISM 2387 CGI VAL A 287 48.534 S.085 33.224 1 .00 18.39 £ 

AISM 23B8 CG2 VAL A 287 47.173 4.670 35.258 1.00 37.79 £ 

40 AISM 2389 N THE A 288 45.904 0.992 34.152 1.00 22.27 H 

AISM 2390 CA THR A 288 45.428 -0.152 34.956 i.oo 19.34 £ 

AISM 2391 C THR A 288 46.561 -1.177 35.227 1.00 27.47 £ 

AISM 2392 P THR A 288 46.778 -1.586 36.365 1.00 24.87 £ 

AISM 2393 CB THR A 288 44.288 -0.909 34.244 1.00 22.86 £ 

45 AlfiM 2394 OG1 THR A 288 43.120 -0.096 34.106 1.00 24.84 Q 
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atom 2395 cm rm a 288 43.916 -2. 113 3fi.Q24 1,90 25, Oft £ 

ATOM 2396 W AUG A 289 47.290 -1.585 34.179 1.00 26.08 H 

ATOM 2397 CA ARC A 289 48.428 -2.506 34.319 1.00 16.92 £ 

atom 2398 c ftBg ft 289 49.405 -2.037 35.408 1.00 22, 96 £ 

5 ATOM 2399 O ARG A 289 49.847 -2.790 36.275 1.00 23.03 £ 

ATOM 2400 CB ARG A 289 49.208 -2.607 32.976 1.00 12.43 £ 

ATOM 2401 CG ARG A 289 48.934 -3.804 32.103 1.00 29.39 £ 

ATOM 2402 CD ARG A 289 50.016 -4.102 31.037 1.00 25.88 £ 

ATOM 2403 NE ARG A 289 49.441 -4.996 30.020 1.00 17.26 H 

10 ATOM 2404 CZ ARG A 289 50.053 -5.459 28.930 1.00 38.82 £ 

ATOM 2405 KH1 ARG A 289 51.306 -5.153 28.660 1.00 13.51 N. 

ATOM 2406 WH2 ARG A 289 49.400 -6.262 28.096 1.00 37.68 H 

ATOM 2407 W LEU A 290 49.81S -0.786 35.306 1.00 26.60 K 

ATOM 2408 CA LEO A 290 SO. 809 -0.254 36.219 1.00 25.42 £ 

15 ftlCM 2409 C LEV ft 290 SO. 324 -0.376 37.6S6 1.00 24.17 £ 

ATOM 2410 O LEU A 290 51.072 -0.759 38.574 1.00 19.94 fl 

him 2411 CB LEV ft 290 51, ppp 1.219 35.976 1.00 24. 66 £ 

ATOM 2412 CG LEU A 290 52.281 2.019 36.066 1.00 24.67 £ 

ATOM 2413 CD1 LEU A 290 51.992 3.479 36.504 1.00 29.25 £ 

20 ATOM 2414 CD2 LEU A 290 53.450 1.335 36.788 1.00 15.82 £ " 

ftlSM 2415 n HIS ft 291 49.093 P. 075 37.868 1.00 30.10 H 

ftISM 241<? Cft HIS ft 291 46.5 13 0.074 39.212 1.00 34,17 £ 

ftISM 2417 C HIS ft 291 49.411 -1.367 39.730 1.00 43.41 £ 

ATOM 2418 O HIS A 291 48.621 -1.654 40.929 1.00 38.81 2 

25 ATOM 2419 CB HTS A 291 47.113 0.674 39.143 1.00 28.01 £ 

hXSM 2420 CP HIS ft 291 47,097 2,153^ 30,984 1,00 29, 6P £ 

ftlOU 2421 KDl HIS A 291 48.242 2.921 39.015 1,0P 35.63 H 

ATOM 2422 CD2 HTS A 291 46.068 3,024 38.855 l.QQ 31,18 E 

ftlSM 2423 CE1 HIS A 291 47.926 4.197 38,845 1,00 24.20 C 

30 ATOM 2424 WE2 HIS A 291 46.612 4.289 38.747 1,QQ 21,92 & 

ATOM 2425 N GLN A 292 48.048 -2.260 38.821 1.00 30.71 U 

him 2426 ca gln a 292 47.95Q -3,654 39.181 1,00 34,82 £ 

ATOM 2427 C GLN A 292 49.287 -4.197 39.622 1.00 36.93 £ 

ATOM 2428 O GT.N A 292 49.323 -5.040 40.510 1.00 27.56 £ 

35 ATOM 2429 CB GLN A 292 47.322 -4.487 38.069 1.00 28.23 £ 

him 2430 Cg gLW ft 292 45,798 -4,405 38,171 1.00 81,15 £ 

ftlffH 2431 CD gln a 292 45.023 -4,954 36.963 1,00100.00 £ 

him 2432 OE1 GLN A 292 45.597 -S.410 35.951 1.00 99.65 fl 

AT CM 2433 NE2 GLN A 292 43.687 -4.895 37.073 1.00 40.86 U 

40 ATOM 2434 N LEU A 293 5 0.375 -3.658 39.058 1 .00 31.75 H 

ftXOI 2435 CA LEU A 293 51.750 -4.072 39.383 1.00 22.67 £ 

ATOM 2436 C LEU A 293 52 .238 -3.323 40.613 1.00 28.64 £ 

ATOM 2437 O LEU A 293 53.420 -3.377 41.017 1.00 22.27 £ 

him 2438 CB LEU A 293 52.665 -3.769 38.205 1.00 25.57 £ 

45 ftipu 2439 Cg LEP ft 293 52.497 -4.703 37.016 1.00 35rll £ 



WO 99/64618 



PCT/US99/11576 



136 



ATOM 


2449 


CDl LEU A 293 


53,306 . 


-4.17P 


35.836 


-1.00 


29,25 


C 


ATOM 


2441. 


CD2 LEU A 293 


52.965 


-6.11Q 


37.439 


1.00 


47.81 


c 


ATOM 


244? 


M GLY A 294 


51.316 


-2.510 


41.111 


.1,00 


33.08 


» 


ATOM 


2443 


CA GLY A 294 


51,488 


-1.793 


42.317 


1.00 


24.90 


C 


ATOM 


2444. 


C GLY A 294 


52.272 


-9.512 


42.326 


1,00 


29.31 


C 


ATOM 


2445 


0 GLY A 294 


53.070 


-0.249 


13.22? 


1.00 


25.25 


Q 


ATOM 


2446 


M TRP A 295 


52.000 


0.347 


41.368 


1.00 


27.83 


N 


ATOM 


. 2447. 


CA TRP A 295 


52.687 


1.623 


41.385 


1.00 


19.45 




ATOM 


244? 


C TRP A 295 


51.684 


2.731 


41.081 


1.99 


25.79 


C 


ATOM 


244? 


O TRP A 295 


50,765 


2. 527 


49.297 


1.00 


20.43 


9 


ATOM 


245P 


CB TRP A 295 


53.961 


1.614 


40.524 


1,90 


12.85 


c 


ATOM 


2451 


CG TRP A 295 


54.750 


2.911 


40.618 


1.00 


23.04 


C 


ATOM 


2452 


CDl TRP A 295 


55.897 


3,161 


41.368 


1,00 


23.68 


c 


ATOM 


2453 


CD2 TRP A 295 


54.415 


4.159 


39.979 


1.99 


29.72 


c 


ATOM 


24?4 


NE1 TRP A 295 


56,258 


4,493 


41.241 


-L.90 


18.67 


N 


ATOM 


2459 


CE2 TRP A 295 


55.389 


5.113 


40.373 


1.00 


20.95 


c 


ATOM 


245f 


CE3 TRP A 295 


53.406 


4.550 


39.102 


1.00 


21.47 


„,.c 


ATOM 


2457 


CZ2 TRP A 295 


55,338 


6.439 


39.958 


1,00 


17.58 


c 


ATOM 


245P 


CZ3 TRP A 295 


53.403 


5.973 


38.632 


1.00 


21.57 


c 


ATOM 


2459 


CH2 TRP A 295 


54.368 


6.787 


39.058 


1.99 


19.45 


c 


ATOM 


2460 


N TYR A 296 


51.709 


3.797 


41.884 


1.00 


25.17 


If 


ATOM 


2461 


CA TYR A 296 


5Q.720 


4.883 


41.731 


1,00 


24.90 


c 


ATOM 


24« 


0 TYR A 2?? 


51.517 


6.178 


11.957 


1,00 


30.85 


c 


ATOM 


24M 


0 TYR A 29G 


52.363 


$.272 


42.745 


1.00 


21.27 


p 


ATOM 


2454 


CB TYR A 296 


49.654 


4.813 


42.840 


1.00 


25.18 


c 


ATOM 


24M 


CG TYR A 296 


48.685 


3,651 


42.744 


1,00 


23.04 


c 


ATOM 


im 


0P1 TYR A 29* 


49.078 


2.343 


43.088 


1,00 


31.62 


c 


ATOM 


2457 


CD2 TYR A 296 


47,389 


3.853 


42.289 


IrOP 


26.02 


c 


ATOM 


246? 


0E1 TYR A 296 


48.203 


1.268 


42.935 


1,90 


24.42 


c 


ATOM 


24P9 


0B2 TYR A 296 


46,49? 


2.770 


42.127 


IrOP 


24.81 




ATOM 


247Q 


CE TYR A 296 


46,902 


1.483 


42.464 


1.00 


39.41 


c 


ATOM 


2471 


OH TYR A 29? 


45,994 


0,434 


42.337 


.1,00 


66.19 


0 


ATOM 


2472 


N HIS A 297 


51.324 


7.123 


49.924 


1.00 


20.95 


w 


ATOM 


2473 


OA HIS A 297 


52,130 


8.34? 


40.938 


1.00 


26.86 


c 


ATOM 


2474 


0 HIS A 297 


51.947 


?,175 


42.210 


IrOP 


35.01 


Q 


ATOM 


2475 


O HIS A 297 


50.885 


9,132 


42,874 


1.00 


26.92 


0 


ATOM 


247§ 


CB HIS A 297 


51.519 


9.192 


39.733 


1.00 25.77 


c 


ATOM 


2477 


CG HIS A 297 


50.499 


9.842 


39.803 


1.00 31.16 


c 


ATOM 


..,247? 


NPl HIS h 291. . 


49.314 


9,145 


39,633 


1.00 


34.21 


N 


ATOM 


2479 


CD2 HIS A 297 


50,135 


U. 094 


40.167 


1.00 


25.83 


C 


ATOM 


2460 


CR1 HIS A 297 


48.290 


9.972 


39.776 


1.00 24.14 


c 


ATOM 


2491 


NE2 HIS A 297 


18.761 


11.164 


40.087 




23.35 


N 


ATOM 


2492 


M GLU A 29B 


52.98? 


9.926 


42.554 


1.00 


24.98 


N 


ATOM 


2493 


CA 51V A 299 


52r?5? 


1P.6B3 


43.79? 


1-00 


27.65 


C 


ATOM , 


2494 


C GLU A 298 


52.931 


12.197 


43.741 


.1.00 


36rP6 


c 



10 



15 



20 



25 



30 



35 



40 



45 



WO 99/64618 



PCT/US99/11S76 



137 



A T raf 2485 O GT.U A 798 5 ?.'-" 12-792 44 .TIB 1.00 43.61 2 

?4B6 cb glu a 298 54 , 15? 10.319 4 . 4.686 1.00 22.02 ; £ 

ATOM 2487 CG GEO A 298 54.004 8.943 45,285 1 . 00 36,4,2 £ 

atom 248B cn ct.u a 298 54,99? 8,664 46.406 I.QPIOO.OQ £ 

5 atom 24B9 obi gltj A 298 56,223 8.561 46 . 152 1.00 44.79 S 

ATOM 2490 OE2 GLU A 298 54.526 8.470 47.547 1,00100,00 2 

ATOM 2491 M TLE A 299 5 ?!-737 12.800 42.639 1.00 23.49 H 

ATOM 2492 CA MB A 299 53.268 14- 244 42.562 1.00 13.25 £ 

ATOM 2493 C TLE A 2 99 52.016 14.848 41.906 1.00 27.05 £ 

10 ATOM 2494 O ILE A 299 5 1.681 14.530 40.757 1.00 26.73 S 

ATOM 2495 CB TLB A 7 99 54.586 14.711 41.862 1.00 15.93 £ 

ATOM 2496 CGI ill A 299 S5-B36 14.183 42.606 1.00 23.83 £ 

atom 2497 cg2 tle a 299 S4.596 16.213 41.541 1.00 17.37 £ 

atom 2498 cm tle a 299 57.232 14.221 41,787 IrOO 21.32 £ 

15 atom 2499 N SER A 3 00 51.323 15.716 42.648 1.00 18.55 H 

ATOM 2500 CA SER A 300 50.177 16 .449 42.091 1.00 19.5B £ 

atom 2501 c SEP A 3QO 50.714 17.415 41.042 1.00 17.29 £ 

ATOM 2502 O SER A 300 51.824 17.941 41.178 1.00 21.06 Q 

atom 2503 cb ser a 300 49.542 17.307 43,181 1,00 16.78 £ 

20 ATOM 2504 OS SER A 300 50.548 17.969 43.923 1.00 75.80 Q 

ATOM 2505 W MP A 301 49-B70 17.755 40.075 1.00 16.13 U 

ATOM 2506 CA T.BO A 301 50.246 IB. 675 39.014 1.00 17.70 £ 

ATOM 2507 C LED A 301 50-6B9 19.964 39.646 1.00 20.11 £ 

ATOM 2508 O I.EOA 301 51-714 20.568 39.303 1.00 20.46 £ 

25 atom 7509 ch t.ru a 301 48.990 18.981 38.197 1.00 17 . 92 C 

atom 2510 cg T.Eu a 301 49.182 20.030 37.112 1,00 25.15 £ 

atom 2511 cm leu a 301 50.233 19.552 36.086 1.00 18.82 £ 

ATOM 2512 CT.2 LEU A 301 47.BS4 20.177 36.436 1.00 25.88 £ 

atom 2S13 w glu a 302 49,845 20.398 40.554 1.00 27.01 H 

30 atom 7514 ca GT.o a 302 50.053 21.636 41.280 1.00 37.72 £ 

ATOM 2515 C GIiD A 302 51 410 71.618 41.996 1.00 29.99 £ 

atom 2516 a gt.u a 302 52,245 22.514 41,798 1,00 27.15 2 

atom 2517 cb glu a 302 48-699 21,841 42.275 1.00 43. IP £ 

atom 2518 cg glu a 302 49.061 23.061 43.174 1.00 90,85 £ 

35 atom 2519 cd glu a 302 48.451 24.324 42.580 1.00100.00 £ 

ATOM 2520 OE1 GLU A 302 47 -566 24.209 41.706 1.00100.00 £ 

atom 2521 QE2 glu a 302 48.PP8 25.432 43.036 1,00 64.50 S 

ATOM 2522 W ALA A 303 51.646 20.591 42.801 1.00 8.72 H 

atom 2523 ca ala a 303 52.937 20.455 43,459 1.00 15,03 £ 

40 atom 2524 c ala a 303 54,102 20,355 42.450 1.00 19.85 £ 

atom 2525 o ata a 303 55.104 21.090 42.553 1,00 22.24 Q 

atom 2526 cb ala a 303 52,939 19,258 44.410 1,00 18,97 £ 

ATOM 2527 W GLY A 304 5 3.953 19.472 41.467 1.00 13.05 U 

atom 2528 ca gly a 304 54,970 19.321 40.448 1,00 8.94 £ 

45 ATOM 2529 C GLY A 304 5 5-739 20.621 39.695 1.00 20.31 £ 
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ttKM 2530 P SLY ft 304 56.394 20.900 39.3?? 1.00 14.30 £ 

2531 N LEV ft 305 54.191 31.383 39.361 l.no io. 7 g H 

ftTCM 253? Cft LEV ft 305 54.483 22.622 38.611 1.00 20.29 £ 

AlOI 2533 C LEV ft 305 S5.2B1 23.669 39.456 1.00 gft.02 £ 

5 AEM 2534 P LEV ft 305 56.194 24.385 38.974 1.00 n.so Q 

filSM 2535 CB LEV ft 305 53.202 23.245 38.033 1.00 24.03 £ 

fiXCM 2536 CG IW A MS 52.357 22.647 36.BB0 1.00 27.66 £ 

iSSM 2537 CP1 LEV ft 305 50.975 23.384 36.7B9 1.00 13.44 £ 

AZSM 2536 SD2 LEV ft 305 53.079 22.724 35.S43 1.00 1B.39 £ 

10 &IQM 2539 N ftlft ft 306 54.904 23.757 40.724 1.00 19.94 ft 

ftXgtt 2540 CA MA A 306 SS.544 24.660 41.655 1 .00 24.79 £ 

AISM 2541 C ftLft ft 306 57.035 24.3BO 41.743 1.00 27.51 £ 

7M2 Q ftLft ft 30? 57.852 25.280 41.662 1.00 29.6ft £ 

2543 CB ftLft ft 306 54.937 24.471 43.002 1.00 17.B7 £ 

15 AISM 2514 V SER ft 307 57.378 23.137 42.011 1.00 18.46 H 

AISM 2545 Cft SER ft 307 58.793 22.756 42.162 1.00 16.31 £ 

2546 C SER ft 307 S9.S47 22.885 40.832 1.00 22.66 £ 

ABM 2547 P SER ft 3P7 60.742 23.212 40.786 1.00 2B.47 £ 

AX2M 254? CB SER ft 307 58.851 21.304 42.622 1.00 20.47 £ 

20 aiSa 2549 Off SER ft 307 58.517 20.4 54 41.526 1.00 29.03 Q 

255Q W THR ft 308 58.849 22.631 39.735 1.00 27.31 

ATSM 2551 Cft THR ft 308 59.458 22.738 3B.413 1.00 22. B9 £ 

Aiai 2552 C THP ft 308 59.757 24.216 38.107 1 00 26 06 £ 

OEM 2553 P THR ft 308 60.819 24.546 » 37.59! 1.00 29. B9 Q 

25 2554 CB THR ft 308 58.536 22. 115 37.318 1.00 18.72 £ 

AISM 2555 ggl THR ft 308 58.356 20.714 37.545 1.00 20.17 £ 

MSai 255S Cg2 THR ft 308 59.094 22.330 35.923 1.00 12.37 £ 

2557 N TYR ft 309 58.846 25.118 38.453 1.00 28.20 H 

2558 CA TYR ft 30*> 59.110 26.549 38.241 1.00 31.09 £ 

30 AXSM 255? C TYR ft 309 60.383 27.059 39.045 1.00 16.31 £ 

25EQ Q TYR ft 309 61.179 27.858 38.577 1.00 16.91 fl 

*!£M 25gl CB TYR ft 309 57.819 27.373 38.533 1.00 31.19 £ 

2562 Cg TYR ft 309 57.944 28.895 ".392 l.OO 14.57 £ 

256? CP1 TYR ft 309 58.397 29. 457 37.224 1.00 17.51 £ 

35 AXSM 2564 CD2 TYR A 309 57.575 29.757 39.44 2 1.00 24.99 £ 

256? CE1 TYP ft m Se.527 30.801 37.100 1.00 18.41 £ 

aiSl 2566 CE2 TYR ft 3Q9 57.744 31.129 39.351 1.00 19.04 £ 

2567 CZ TYR ft 309 58.212 31.641 38.164 1.00 29.13 £ 

OH TYR A 309 5B.300 33.00 4 37.966 1.00 28.22 £ 

40 2569 N PL** ft 31p 60.560 26.579 40.260 1.00 15.41 ft 

fiXPH 2570 Cft gliN ft 310 61.705 26.964 41.087 1.00 22.35 £ 

ftlSM 2571 C QW ft 310 63.001 26.492 40.446 1.00 31.46 £ 

aiSM 2572 P SLN ft 310 64.009 27.191 40.442 1.00 33.42 £ 

2573 CB CLN ft 310 61.5B7 26.335 42.482 1.00 17.67 £ 

45 ASSM 2574 Cff fflN ft 310 62.579 26.921 43.461 1.00 57.5ft £ 
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While various embodiments of the present invention have been described in detail, 
it is apparent that modifications and adaptations of those embodiments will occur to those 
skilled in the art. However, it is to be expressly understood that such modifications and 
adaptations are within the spirit and scope of the present invention, as set forth in the 
S following claims. 
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What is claimed* 

1 . A method for producing ascorbic acid or esters thereof in a microorganism, 
comprising culturing a microorganism having a genetic modification to increase the action 
of an enzyme selected from the group consisting of hexokinase, glucose phosphate 

5 isomerase, phosphomannose isomerase, phosphomannomutase, GDP-D-mannose 
pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose-l-P-phosphatase, L-galactose dehydrogenase, and L- 
galactono-y-lactone dehydrogenase; and recovering said ascorbic acid or esters thereof. 

2. A method, as claimed in Claim 1, wherein said genetic modification is a 
genetic modification to increase the action of an enzyme selected from the group 
consisting of GDP-D-mannose: GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose-l-P-phosphatase, L-galactose dehydrogenase, and L- 
galactono-y -lactone dehydrogenase. 

3. A method, as claimed in Claim 1, wherein said genetic modification is a 
15 genetic modification to increase the action of an epimerase that catalyzes conversion of 

GDP-D-mannose to GDP-L-galactose. 

4. A method, as claimed in Claim 3, wherein said genetic modification is a 
genetic modification to increase the action of GDP-D-mannose:GDP-L-galactose 
epimerase. 

20 5. The method of Claim 3, wherein said genetic modification comprises 

transformation of said microorganism with a recombinant nucleic acid molecule that 
expresses said epimerase. 

6. The method of Claim 5, wherein said epimerase has a tertiary structure that 
substantially conforms to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 

25 epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 
Bank Accession Code Ibws. 

7. The method of Claim 5, wherein said epimerase has a structure having an 
average root mean square deviation of less than about 2.5 A over at least about 25% of 
Ca positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/ 

30 reductase represented by atomic coordinates having Brookhaven Protein Data Bank 
Accession Code Ibws. 
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8. The method of Claim 5, wherein said epimerase has a tertiary structure 
having an average root mean square deviation of less than about 1 A over at least about 
25% of Ca positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data 

5 Bank Accession Code Ibws. 

9. The method of Claim 5, wherein said epimerase comprises a substrate 
binding site having a tertiary structure that substantially conforms to the tertiary structure 
of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase 
represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 

10 Ibws. 

10. The method of Claim 9, wherein said substrate binding site has a tertiary 
structure with an average root mean square deviation of less than about 2.5 A over at least 
about 25% of Ca positions of the tertiary structure of a substrate binding site of a GDP-4- 
keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having 

1 5 Brookhaven Protein Data Bank Accession Code Ibws. 

1 1 . The method of Claim 5, wherein said epimerase comprises a catalytic site 
having a tertiary structure that substantially conforms to the tertiary structure of the 
catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by 
atomic coordinates having Brookhaven Protein Data Bank Accession Code Ibws. 

20 12. The method of Claim 1 1, wherein said catalytic site has a tertiary structure 

with an average root mean square deviation of less than about 2.5 A over at least about 
25% of Ca positions of the tertiary structure of a catalytic site of a GDP-4-keto-6-deoxy- 
D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven 
Protein Data Bank Accession Code Ibws. 

25 13 . The method of Claim 1 1 , wherein said catalytic site comprises the amino 

acid residues serine, tyrosine and lysine. 

14. The method of Claim 13, wherein tertiary structure positions of said amino 
acid residues serine, tyrosine and lysine substantially conform to tertiary structure 
positions of residues Serl07, Tyrl36 and Lysl40, respectively, as represented by atomic 

30 coordinates in Brookhaven Protein Data Bank Accession Code Ibws. 

15. The method of Claim 5, wherein said epimerase binds NADPH. 
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16. The method of Claim 5, wherein said epimerase comprises an amino acid 
sequence that aligns with SEQ ID NO:l 1 using a CLUSTAL alignment program, wherein 
amino acid residues in said amino acid sequence align with 100% identity with at least 
about 50% of non-Xaa residues in SEQ ID NO:ll. 
5 17. The method of Claim 5, wherein said epimerase comprises an amino acid 

sequence that aligns with SEQ ID NO: 1 1 using a CLUSTAL alignment program, wherein 
amino acid residues in said amino acid sequence align with 100% identity with at least 
about 75% of non-Xaa residues in SEQ ID NOrll. 

18. The method of Claim 5, wherein said epimerase comprises an amino acid 
10 sequence that aligns with SEQ ID NO:l 1 using a CLUSTAL alignment program, wherein 

amino acid residues in said amino acid sequence align with 100% identity with at least 
about 90% of non-Xaa residues in SEQ ID NO:l 1. 

19. The method of Claim 5, wherein said epimerase comprises an amino acid 
sequence having at least 4 contiguous amino acid residues that are 100% identical to at 

1 5 least 4 contiguous amino acid residues of an amino acid sequence selected from the group 
consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 and SEQ ID 
NO: 10. 

20. The method of Claim 5, wherein said recombinant nucleic acid molecule 
comprises a nucleic acid sequence comprising at least about 12 contiguous nucleotides 

20 having 100% identity with at least about 12 contiguous nucleotides of a nucleic acid 
sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 and SEQ ID NO:9. 

21 . The method of Claim 5, wherein said epimerase comprises an amino acid 
sequence having a motif: Gly-Xaa-Xaa-Gly-Xaa-Xaa-GHy. 

25 22. The method of Claim 5, wherein said recombinant nucleic acid molecule 

comprises a nucleic acid sequence that is at least about 15% identical to a nucleic acid 
sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 and SEQ ED NO:9, as determined using a Lipman-Pearson method 
with Lipman-Pearson standard default parameters. 

30 23. The method of Claim 5, wherein said recombinant nucleic acid molecule 

comprises a nucleic acid sequence that is at least about 20% identical to a nucleic acid 
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sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method 
with Lipman-Pearson standard default parameters. 

24. The method of Claim 5, wherein said recombinant nucleic acid molecule 
5 comprises a nucleic acid sequence that is at least about 25% identical to a nucleic acid 

sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method 
with Lipman-Pearson standard default parameters. 

25. The method of Claim 5, wherein said recombinant nucleic acid molecule 
10 comprises a nucleic acid sequence that hybridizes under stringent hybridization conditions 

to a nucleic acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/ 
reductase. 

26. The method of Claim 25, wherein said nucleic add sequence encoding said 
GDP-4-keto-6-deoxy-D-mannose epimerase/reductase is selected from the group 

15 consisting of SEQ ID NO.l, SEQ ID NO:3 and SEQ ID NO:5. 

27. The method of Claim 25, wherein said GDP-4-keto-6-deoxy-D-mannose 
epimerase/reductase comprises an amino acid sequence selected from the group consisting 
of SEQ ID NO:2, SEQ ED NO:4 and SEQ ID NO:6. 

28. A method, as claimed in Claim 1, wherein said microorganism is selected 
20 from the group consisting of bacteria, fungi and microalgae. 

29. A method, as claimed in Claim 1, wherein said microorganism is acid- 
tolerant. 

30. A method, as claimed in Claim 1, wherein said microorganism is a 
bacterium. 

25 3 1 . A method, as claimed in Claim 30, wherein said bacterium is selected from 

the group consisting of Azotobacter and Pseudomonas. 

32. A method, as claimed in Claim 1, wherein said microorganism is a fungus. 

33 . A method, as claimed in Claim 32, wherein said microorganism is a yeast. 

34. A method, as claimed in Claim 33, wherein said yeast is selected from the 
3 0 group consisting of Saccharomyces yeast. 
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35. A method, as claimed in Claim 1, wherein said microorganism is a 
microalga. 

36. A method, as claimed in Claim 35, wherein said microalga is selected from 
the group consisting of microalgae of the genera Prototheca and Chlorella. 

5 37. A method, as claimed in Claim 36, wherein said microalga is selected from 

the genus Prototheca 

38. A method, as claimed in Claim 1, wherein said microorganism further 
comprises a genetic modification to decrease the action of an enzyme having GDP-D- 
mannose as a substrate, other than GDP-D-mannose:GDP-L-galactose epimerase. 

10 39. A method, as claimed in Claim 38, wherein said genetic modification to 

decrease the action of an enzyme having GDP-D-mannose as a substrate, other than GDP- 
D-mannose:GDP-L-galactose epimerase is a genetic modification to decrease the action 
of GDP-D-mannose-dehydrogenase. 

40. A method, as claimed in Claim 1, wherein said microorganism is acid- 
15 tolerant and said step of culturing is conducted at a pH of less than about 6.0. 

41. A method, as claimed in Claim 1, wherein said microorganism is acid- 
tolerant and said step of culturing is conducted at a pH of less than about 5.5. 

42. A method, as claimed in Claim 1, wherein said microorganism is acid- 
tolerant and said step of culturing is conducted at a pH of less than about 5.0. 

20 43. A method, as claimed in Claim 1, wherein said step of culturing is 

conducted in a fermentation medium that is magnesium (Mg) limited. 

44. A method, as claimed in Claim 1, wherein said step of culturing is 
conducted in a fermentation medium that is Mg limited during a cell growth phase. 

45. A method, as claimed in Claim 1, wherein said step of culturing is 
25 conducted in a fermentation medium that comprises less than about 0.5 g/L of Mg during 

a cell growth phase. 

46. A method, as claimed in Claim 1, wherein said step of culturing is 
conducted in a fermentation medium that comprises less than about 0.2 g/L of Mg during 
a cell growth phase. 
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47. A method, as claimed in Claim 1, wherein said step of culturing is 
conducted in a fermentation medium that comprises less than about 0. 1 g/L of Mg during 
a cell growth phase. 

48. A method, as claimed in Claim 1, wherein said step of culturing is 
5 conducted in a fermentation medium that comprises a carbon source other than D- 

mannose. 

49. A method, as claimed in Claim 1, wherein said step of culturing is 
conducted in a fermentation medium that comprises glucose as a carbon source. 

50. A microorganism for producing ascorbic acid or esters thereof, wherein 
10 said microorganism has a genetic modification to increase the action of an enzyme 

selected from the group consisting of hexokinase, glucose phosphate isomerase, 
phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, 
GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L- 
galactose-l-P-phosphatase, L-galactose dehydrogenase, and L-galactono-Y-lactone 

15 dehydrogenase. 

51. A microorganism, as claimed in Claim 50, wherein said genetic 
modification is a genetic modification to increase the action of an enzyme selected from 
the group consisting of GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose-l-P-phosphatase, L-galactose dehydrogenase, and L- 

20 galactono-Y-lactone dehydrogenase. 

52. A microorganism, as claimed in Claim 50, wherein said genetic 
modification is a genetic modification to increase the action of GDP-D-mannose:GDP-L- 

galactose epimerase. 

53 . A microorganism, as claimed in Claim 50, wherein said microorganism has 
25 been genetically modified to express a recombinant nucleic acid molecule encoding an 

epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein 
said epimerase has a tertiary structure having an average root mean square deviation of 
less than about 2.5 A over at least about 25% of Ca positions of the tertiary structure of 
a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic 
3 0 coordinates having Brookhaven Protein Data Bank Accession Code lbws. 
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54. A microorganism, as claimed in Claim 50, wherein said microorganism is 
selected from the group consisting of bacteria, fungi and microalgae. 

55. A microorganism, as claimed in Claim 50, wherein said microorganism is 
a bacterium. 

5 56. A microorganism, as claimed in Claim 55, wherein said bacterium is 

selected from the group consisting oiAzotobacter and Pseudomonas. 

57. A microorganism, as claimed in Claim 50, wherein said microorganism is 
a fungus. 

58. A microorganism, as claimed in Claim 57, wherein said microorganism is 

10 a yeast. 

59. A microorganism, as claimed in Claim 58, wherein said yeast is selected 
from the group consisting of Saccharomyces yeast. 

60. A plant for producing ascorbic acid or esters thereof wherein said plant 
has a genetic modification to increase the action of an enzyme selected from the group 

15 consisting of hexokinase, glucose phosphate isomerase, phosphomannose isomerase, 
phosphomannomutase, GDP-D-mannose pyrophosphorylase, GDP-D-mannose:GDP-L- 
galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-l-P-phosphatase, L- 
galactose dehydrogenase, and L-galactono-y -lactone dehydrogenase. 

61. A plant, as claimed in Claim 60, wherein said genetic modification is a 
20 genetic modification to increase the action of an enzyme selected from the group 

consisting of GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose 
phosphorylase, L-galactose-l-P-phosphatase, L-galactose dehydrogenase, and L- 
galactono-y -lactone dehydrogenase. 

62. A plant, as claimed in Claim 60, wherein said genetic modification is a 
25 genetic modification to increase the action of GDP-D-mannose: GDP-L-galactose 

epimerase. 

63. A plant, as claimed in Claim 60, wherein said plant has been genetically 
modified to express a recombinant nucleic acid molecule encoding an epimerase that 
catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein said epimerase 

30 has a tertiary structure having an average root mean square deviation of less than about 
2.5 A over at least about 25% of Ca positions of the tertiary structure of a GDP-4-keto-6- 
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deoxy-D-mannose epimerase/reductase represented by atomic coordinates having 
Brookhaven Protein Data Bank Accession Code Ibws. 

64. A plant, as claimed in Claim 60, wherein said plant further comprises a 
genetic modification to decrease the action of an enzyme having GDP-D-mannose as a 

5 substrate other than GDP-D-mannose:GDP-L-galactose epimerase. 

65. A plant, as claimed in Claim 60, wherein said genetic modification to 
decrease the action of an enzyme having GDP-D-mannose as a substrate other than GDP- 
D-mannose:GDP-L-galactose epimerase is a genetic modification to decrease the action 
of GDP-D-mannose-dehydrogenase. 

10 66. A plant, as claimed in Claim 60, wherein said plant is a microalga. 

67. A plant, as claimed in Claim 66, wherein said plant is selected from the 
group consisting of microalgae of the genera Prototheca and Chlorella. 

68. A plant, as claimed in Claim 66, wherein said microalga is selected from 
the genus Prototheca. 

15 69. A plant, as claimed in Claim 60, wherein said plant is a higher plant. 

70. A plant, as claimed in Claim 60, wherein said plant is a consumable higher 

plant. 

71 . A microorganism for producing ascorbic acid or esters thereof wherein 
said microorganism has been genetically modified to express a recombinant nucleic acid 

20 molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L- 
galactose, wherein said epimerase comprises an amino add sequence that aligns with SEQ 
ED NO: 1 1 using a CLUSTAL alignment program, wherein amino acid residues in said 
amino acid sequence align with 100% identity with at least about 50% of non-Xaa 
residues in SEQ ID NO: 1 1. 

25 72. A plant for producing ascorbic acid or esters thereof; wherein said plant 

has been genetically modified to express a recombinant nucleic acid molecule encoding 
an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein 
said epimerase comprises an amino acid sequence that aligns with SEQ ID NO: 1 1 using 
a CLUSTAL alignment program, wherein amino add residues in said amino acid sequence 

30 align with 100% identity with at least about 50% of non-Xaa residues in SEQ ID NO: 1 1 . 
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Burlingame, Richard P. 

<120> "VITAMIN C PRODUCTION IN MICROORGANISMS AND PLANTS" 

<130> 3161-24-PCT 
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<141> 1999-05-25 

<150> 60/125,073 
<151> 1999-03-17 

<150> 60/125,054 
<151> 1999-03-18 

<150> 60/088,549 
<151> 1998-06-08 

<160> 15 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 1583 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (49) . . (993) 

<400> 1 

tagtctttaa tttcgcagcg tttttataat tgtgcagagg tttcgtcc atg tct gac 57 

Met Ser Asp 
1 

aaa tct gcc aaa ate ttc gtc gcg ggt cat cgt ggt ttg gtt gga tct 105 
Lys Ser Ala Lys lie Phe Val Ala Gly His Arg Gly Leu Val Gly Ser 
5 10 15 

gcc att gtc cgc aag ctt cag gaa caa ggt ttc acc aat etc gtt ctt 153 
Ala lie Val Arg Lys Leu Gin Glu Gin Gly Phe Thr Asn Leu Val Leu 
20 25 30 35 



1 
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aaa aca cac gcc gag ctt gat etc act cgt caa gec gat gtt gaa tec 201 

Lys Thr His Ala Glu Leu Asp Leu Thr Arg Gin Ala Asp Val Glu Ser 

40 45 50 

ttc ttt tct caa gag aag cca gtt tat gta ate eta gca gca get aaa 249 

Phe Phe Ser Gin Glu Lys Pro Val Tyr Val lie Leu Ala Ala Ala Lys 

55 60 65 

gtt ggt ggt att cac get aac aac acc tat cct get gat ttc att ggt 297 

Val Gly Gly lie His Ala Asn Asn Thr Tyr Pro Ala Asp Phe lie Gly 

70 75 80 



gtc aat etc cag att cag acc aat gtg ate cac tct gca tat gag cac 
Val Asn Leu Gin He Gin Thr Asn Val He His Ser Ala Tyr Glu His 
85 90 95 



345 



ggt gtg aag aag ctt etc ttc ctt gga tea tec tgc att tac cct aaa 393 
Gly Val Lys Lys Leu Leu Phe Leu Gly Ser Ser Cys He Tyr Pro Lys 
100 105 110 115 

ttt get cct cag cca ^tt cct gag tct get ttg tta aca gca teg ctt 441 
Phe Ala Pro Gin Pro He Pro Glu Ser Ala Leu Leu Thr Ala Ser Leu 
120 125 130 

gaa cca act aat gag tgg tat get att get aag ate get ggg att aag 489 
Glu Pro Thr Asn Glu Trp Tyr Ala He Ala Lys He Ala Gly He Lys 
135 140 145 

act tgt cag get tat agg att cag cac gga tgg gat gca ate tct ggc 537 
Thr Cys Gin Ala Tyr Arg He Gin His Gly Trp Asp Ala He Ser Gly 
150 155 160 

atg cct act aat etc tat ggt cct aat gac aat ttc cac ccg gag tct 585 
Met Pro Thr Asn Leu Tyr Gly Pro Asn Asp Asn Phe His Pro Glu Ser 
165 170 175 

cat gtg ctt cct get ctt atg agg agg ttc cac gag gcg aaa gtg aat 633 
His Val Leu Pro Ala Leu Met Arg Arg Phe His Glu Ala Lys Val Asn 
180 185 190 195 

tgg age gga gga agt tgt ggt gtg ggg tac aag gta gtc ccg ttg gaa 681 
Trp Ser Gly Gly Ser Cys Gly Val Gly Tyr Lys Val Val Pro Leu Glu 
200 205 210 

ggg aag ttc ttg cat gtt gat gat ttg get gat get tgt gtt ttc ttg 729 
Gly Lys Phe Leu His Val Asp Asp Leu Ala Asp Ala Cys Val Phe Leu 
215 220 225 
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ctg gat egg ata cag egg ggg ttg gag cat gtt aac att gga agt ggt 777 
Leu Asp Arg He Gin Arg Gly Leu Glu His Val Asn He Gly Ser Gly 
230 235 240 

caa gaa gtg act att aga gag ttg get gag ttg gtg aaa gag gtt gtt 825 
Gin Glu Val Thr He Arg Glu Leu Ala Glu Leu Val Lys Glu Val Val 
245 250 255 

ggt ttt gaa ggg aag ctt gga tgg gat tgc act aag cca gat ggc aca 873 
Gly Phe Glu Gly Lys Leu Gly Trp Asp Cys Thr Lys Pro Asp Gly Thr 
260 265 270 275 

ccg agg aaa ctt atg gac age tea aag etc gcg tct ttg ggt tgg aca 921 
Pro Arg Lys Leu Met Asp Ser Ser Lys Leu Ala Ser Leu Gly Trp Thr 
280 285 290 

cct aag gtt tct ctt aga gat ggt ctg age caa act tat gat tgg tat 969 
Pro Lys Val Ser Leu Arg Asp Gly Leu Ser Gin Thr Tyr Asp Trp Tyr 
295 300 305 

ttg aag aat gtt tgc aac cga taa gttaatggtt tctcttctca tatatacaca 1023 
Leu Lys Asn Val Cys Asn Arg 

310 315 

actattgagt ctcaggtaaa tcagcttatc accacattgt gatttaaacc tttctttgag 1083 

attcgagaat tgcttttttt tttatcaaaa ttgattcatt tagagataag acttgettet 1143 

ttatacaaca ttgtctgagg aattttaatt ttggatctcc gagtatggtc tattattagc 1203 

tctcttctat acaaattatc aaaacagttg taagaagttt caagaaaaac atttgatatc 1263 

tcactaattt ggctatcctt geaagttgea aegctaaaat gacaaataat gaattctegg 1323 

cccaatgggc ttacacaagc cttgttaaag atagcgtgaa caaaacgegg ctcactagcc 1383 

ctaacctgtc tctctttcgc ttaccttctt cttcgtcttc gttggctcag tcacttgact 1443 

tcacggcccg ctcaagctct gacacgaaac tcatttcaaa ttaatttaat aaaaccttaa 1503 

tcacaaaagg ggcaaaagca atcgccggcg attatgeett ctcctccggt geeggagacg 1563 

gttgtgagcc aacccgttcg 1583 



<210> 2 
<211> 314 
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<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Ser Asp Lys Ser Ala Lys lie Phe Val Ala Gly His Arg Gly Leu 
1 5 10 15 

Val Gly Ser Ala He Val Arg Lys Leu Gin Glu Gin Gly Phe Thr Asn 
20 25 30 

Leu Val Leu Lys Thr His Ala Glu Leu Asp Leu Thr Arg Gin Ala Asp 
35 40 45 

Val Glu Ser Phe Phe Ser Gin Glu Lys Pro Val Tyr Val He Leu /Via 
50 55 60 

Ala Ala Lys Val Gly Gly He His Ala Asn Asn Thr Tyr Pro Ala Asp 
65 70 75 80 

Phe He Gly Val Asn Leu Gin He Gin Thr Asn Val He His Ser Ala 
85 90 95 

Tyr Glu His Gly Val Lys Lys Leu Leu Phe Leu Gly Ser Ser Cys He 
100 105 110 

Tyr Pro Lys Phe Ala Pro Gin Pro He Pro Glu Ser Ala Leu Leu Thr 
115 120 125 

Ala Ser Leu Glu Pro Thr Asn Glu Trp Tyr Ala He Ala Lys He Ala 
130 135 140 

Gly He Lys Thr Cys Gin Ala Tyr Arg He Gin His Gly Trp Asp Ala 
145 150 155 160 

He Ser Gly Met Pro Thr Asn Leu Tyr Gly Pro Asn Asp Asn Phe His 
165 170 175 

Pro Glu Ser His Val Leu Pro Ala Leu Met Arg Arg Phe His Glu Ala 
180 185 190 

Lys Val Asn Trp Ser Gly Gly Ser Cys Gly Val Gly Tyr Lys Val Val 
195 200 205 

Pro Leu Glu Gly Lys Phe Leu His Val Asp Asp Leu Ala Asp Ala Cys 
210 215 220 

Val Phe Leu Leu Asp Arg He Gin Arg Gly Leu Glu His Val Asn He 
225 230 235 240 
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Gly Ser Gly Gin Glu Val Thr lie Arg Glu Leu Ala Glu Leu Val Lys 
245 250 255 

Glu Val Val Gly Phe Glu Gly Lys Leu Gly Trp Asp Cys Thr Lys Pro 
260 265 270 

Asp Gly Thr Pro Arg Lys Leu Met Asp Ser Ser Lys Leu Ala Ser Leu 
275 280 285 

Gly Trp Thr Pro Lys Val Ser Leu Arg Asp Gly Leu Ser Gin Thr Tyr 
290 295 300 

Asp Trp Tyr Leu Lys Asn Val Cys Asn Arg 
305 310 



<210> 3 

<211> 966 

<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (966) 

<400> 3 

atg agt aaa caa cga gtt ttt att get ggt cat cgc ggg atg gtc ggt 48 

Met Ser Lys Gin Arg Val Phe lie Ala Gly His Arg Gly Met Val Gly 
15 10 15 

tec gec ate agg egg cag etc gaa cag cgc ggt gat gtg gaa ctg gta 96 
Ser Ala lie Arg Arg Gin Leu Glu Gin Arg Gly Asp Val Glu Leu Val 
20 25 30 

tta cgc acc cgc gac gag ctg aac ctg ctg gac age cgc gee gtg cat 144 
Leu Arg Thr Arg Asp Glu Leu Asn Leu Leu Asp Ser Arg Ala Val His 
35 40 4o 

gat ttc ttt gec age gaa cgt att gac cag gtc tat ctg gcg gcg gcg 192 
Asp Phe Phe Ala Ser Glu Arg lie Asp Gin Val Tyr Leu Ala Ala Ala 
50 55 60 

aaa gtg ggc ggc att gtt gec aac aac acc tat ccg gcg gat ttc ate 240 
Lys Val Gly Gly lie Val Ala Asn Asn Thr Tyr Pro Ala Asp Phe lie 
65 70 75 80 

tac cag aac atg atg att gag age aac ate att cac gec gcg cat cag 288 
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Tyr Gin Asn Met Met He Glu Ser Asn He He His Ala Ala His Gin 

85 90 95 

aac gac gtg aac aaa ctg ctg ttt etc gga teg tec tgc ate tac ccg 336 

Asn Asp Val Asn Lys Leu Leu Phe Leu Gly Ser Ser Cys He Tyr Pro 

100 105 110 

aaa ctg gca aaa cag ccg atg gca gaa age gag ttg ttg cag ggc acg 384 

Lys Leu Ala Lys Gin Pro Met Ala Glu Ser Glu Leu Leu Gin Gly Thr 

115 120 125 

ctg gag ccg act aac gag cct tat get att gee aaa ate gee ggg ate 432 

Leu Glu Pro Thr Asn Glu Pro Tyr Ala He Ala Lys He Ala Gly He 

130 135 140 

aaa ctg tgc gaa tea tac aac cgc cag tac gga cgc gat tac cgc tea 480 

Lys Leu Cys Glu Ser Tyr Asn Arg Gin Tyr Gly Arg Asp Tyr Arg Ser 

145 150 155 160 

gtc atg ccg ace aac ctg tac ggg cca cac gac aac ttc cac ccg agt 528 

Val Met Pro Thr Asn Leu Tyr Gly Pro His Asp Asn Phe His Pro Ser 

165 • 170 175 

aat teg cat gtg ate cca gca ttg ctg cgt cgc ttc cac gag gcg acg 576 

Asn Ser His Val He Pro Ala Leu Leu Arg Arg Phe His Glu Ala Thr 

180 185 190 

gca cag aat gcg ccg gac gtg gtg gta tgg ggc age ggt aca ccg atg 624 

Ala Gin Asn Ala Pro Asp Val Val Val Trp Gly Ser Gly Thr Pro Met 

195 200 205 

cgc gaa ttt ctg cac gtc gat gat atg gcg gcg gcg age att cat gtc 672 

Arg Glu Phe Leu His Val Asp Asp Met Ala Ala Ala Ser He His Val 

210 215 220 

atg gag ctg gcg cat gaa gtc tgg ctg gag aac ace cag ccg atg ttg 720 

Met Glu Leu Ala His Glu Val Trp Leu Glu Asn Thr Gin Pro Met Leu 

225 230 235 240 

teg cac att aac gtc ggc acg ggc gtt gac tgc act ate cgc gac gtg 768 

Ser His lie Asn Val Gly Thr Gly Val Asp Cys Thr He Arg Asp Val 

245 250 255 

gcg caa acc ate gee aaa gtg gtg ggt tac aaa ggc egg gtg gtt ttt 816 

Ala Gin Thr He Ala Lys Val Val Gly Tyr Lys Gly Arg Val Val Phe 

260 265 270 

gat gec age aaa ccg gat ggc acg ccg cgc aaa ctg ctg gat gtg acg 864 
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Asp Ala Ser Lys Pro Asp Gly Thr Pro Arg Lys Leu Leu Asp Val Thr 
275 280 285 

cgc ctg cat cag ctt ggc tgg tat cac gaa ate tea ctg gaa gcg ggg 912 
Arg Leu His Gin Leu Gly Trp Tyr His Glu He Ser Leu Glu Ala Gly 
290 295 300 

ctt gec age act tac cag tgg ttc ctt gag aat caa gac cgc ttt egg 960 
Leu Ala Ser Thr Tyr Gin Trp Phe Leu Glu Asn Gin Asp Arg Phe Arg 
305 310 315 320 

ggg taa 966 

Gly 



<210> 4 
<211> 321 
<212> PRT 

<213> Escherichia coli 
<400> 4 

Met Ser Lys Gin Arg Val Phe He Ala Gly His Arg Gly Met Val Gly 
1 5 10 15 

Ser Ala He Arg Arg Gin Leu Glu Gin Arg Gly Asp Val Glu Leu Val 
20 25 30 

Leu Arg Thr Arg Asp Glu Leu Asn Leu Leu Asp Ser Arg Ala Val His 
35 40 45 

Asp Phe Phe Ala Ser Glu Arg He Asp Gin Val Tyr Leu Ala Ala Ala 
50 55 60 

Lys Val Gly Gly He Val Ala Asn Asn Thr Tyr Pro Ala Asp Phe He 
65 70 75 80 

Tyr Gin Asn Met Met He Glu Ser Asn He He His Ala Ala His Gin 
85 90 95 

Asn Asp Val Asn Lys Leu Leu Phe Leu Gly Ser Ser Cys He Tyr Pro 
100 105 no 

Lys Leu Ala Lys Gin Pro Met Ala Glu Ser Glu Leu Leu Gin Gly Thr 
115 120 125 

Leu Glu Pro Thr Asn Glu Pro Tyr Ala He Ala Lys He Ala Gly He 
130 135 140 
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Lys Leu Cys Glu Ser Tyr Asn Arg Gin Tyr Gly Arg Asp Tyr Arg Ser 
145 150 155 160 

Val Met Pro Thr Asn Leu Tyr Gly Pro His Asp Asn Phe His Pro Ser 
165 170 175 

Asn Ser His Val He Pro Ala Leu Leu Arg Arg Phe His Glu Ala Thr 
180 185 190 

Ala Gin Asn Ala Pro Asp Val Val Val Trp Gly Ser Gly Thr Pro Met 
195 200 205 

Arg Glu Phe Leu His Val Asp Asp Met Ala Ala Ala Ser lie His Val 
210 215 220 

Met Glu Leu Ala His Glu Val Trp Leu Glu Asn Thr Gin Pro Met Leu 
225 230 235 240 

Ser His He Asn Val Gly Thr Gly Val Asp Cys Thr He Arg Asp Val 
245 250 255 

Ala Gin Thr He Ala Lys Val Val Gly Tyr Lys Gly Arg Val Val Phe 
260 265 270 

Asp Ala Ser Lys Pro Asp Gly Thr Pro Arg Lys Leu Leu Asp Val Thr 
275 280 285 

Arg Leu His Gin Leu Gly Trp Tyr His Glu He Ser Leu Glu Ala Gly 
290 295 300 

Leu Ala Ser Thr Tyr Gin Trp Phe Leu Glu Asn Gin Asp Arg Phe Arg 
305 310 315 320 

Gly 



<210> 5 
<211> 1340 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (75) . . (1040) 

<400> 5 

ctagaattca gcggccgctg aattctagct agaattcagc ggccgctgaa ttctagaacc 60 
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caggtgcaac tgac atg ggt gaa ccc cag gga tec atg egg att eta gtg 110 

Met Gly Glu Pro Gin Gly Ser Met Arg lie Leu Val 

15 io 

aca ggg ggc tct ggg ctg gta ggc aaa gec ate cag aag gtg gta gca 158 
Thr Gly Gly Ser Gly Leu Val Gly Lys Ala He Gin Lys Val Val Ala 
15 20 25 

gat gga get gga ctt cct gga gag gac tgg gtg ttt gtc tec tct aaa 206 
Asp Gly Ala Gly Leu Pro Gly Glu Asp Trp Val Phe Val Ser Ser Lys 
30 35 40 

gac gee gat etc acg gat aca gca cag ace cgc gee ctg ttt gag aag 254 
Asp Ala Asp Leu Thr Asp Thr Ala Gin Thr Arg Ala Leu Phe Glu Lys 
45 50 55 60 

gtc caa ccc aca cac gtc ate cat ctt get gca atg gtg ggg ggc ctg 302 
Val Gin Pro Thr His Val He His Leu Ala Ala Met Val Gly Gly Leu 
65 70 75 

ttc egg aat ate aaa tac aat ttg gac ttc tgg agg aaa aac gtg cac 350 
Phe Arg Asn He Lys Tyr Asn Leu Asp Phe Trp Arg Lys Asn Val His 
80 85 90 

atg aac gac aac gtc ctg cac teg gee ttt gag gtg ggg gee cgc aag 398 
Met Asn Asp Asn Val Leu His Ser Ala Phe Glu Val Gly Ala Arg Lys 
95 100 105 

gtg gtg tec tgc ctg tec acc tgt ate ttc cct gac aag acg ace tac 446 
Val Val Ser Cys Leu Ser Thr Cys He Phe Pro Asp Lys Thr Thr Tyr 
HO 115 120 



ccg ata gat gag acc atg ate cac aat ggg cct ccc cac aac age aat 

Pro He Asp Glu Thr Met He His Asn Gly Pro Pro His Asn Ser Asn 

125 130 135 140 

ttt ggg tac teg tat gec aag agg atg ate gac gtg cag aac agg gee 

Phe G±y Tyr Ser Tyr Ala Lys Arg Met He Asp Val Gin Asn Arg Ala 

14 5 150 155 



494 



542 



tac ttc cag cag tac ggc tgc acc ttc acc get gtc ate ccc acc aac 590 
Tyr Phe Gin Gin Tyr Gly Cys Thr Phe Thr Ala Val He Pro Thr Asn 
160 165 170 

gtt ttc ggg ccc cac gac aac ttc aac ate gag gat ggc cac gtg ctg 638 
Val Phe Gly Pro His Asp Asn Phe Asn He Glu Asp Gly His Val Leu 
175 180 185 
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cct ggc etc ate cac aag gtg cac ctg gee aag age age ggc teg gee 686 

Pro Gly Leu lie His Lys Val His Leu Ala Lys Ser Ser Gly Ser Ala 

190 195 200 

ctg acg gtg tgg ggt aca ggg aat ccg egg agg cag ttc ata tac teg 734 
Leu Thr Val Trp Gly Thr Gly Asn Pro Arg Arg Gin Phe He Tyr Ser 
205 210 215 220 

ctg gac ctg gee cag etc ttt ate tgg gtc ctg egg gag tac aat gaa 782 
Leu Asp Leu Ala Gin Leu Phe He Trp Val Leu Arg Glu Tyr Asn Glu 
225 230 235 

gtg gag ccc ate ate etc tec gtg ggc gag gaa gat gag gtc tec ate 
Val Glu Pro He He Leu Ser Val Gly Glu Glu Asp Glu Val Ser He 
240 245 250 

aag gag gca gee gag gcg gtg gtg gag gee atg gac ttc cat ggg gaa 
Lys Glu Ala Ala Glu Ala Val Val Glu Ala Met Asp Phe His Gly Glu 
255 260 265 

gtc acc ttt gat aca ace aag teg gat ggg cag ttt aag aag aca gee 
Val Thr Phe Asp Thr *»hr Lys Ser Asp Gly Gin Phe Lys Lys Thr Ala 
270 275 280 

agt aac age aag ctg agg acc tac ctg ccc gac ttc egg ttc aca ccc 
Ser Asn Ser Lys Leu Arg Thr Tyr Leu Pro Asp Phe Arg Phe Thr Pro 
285 290 295 300 

ttc aag cag gcg gtg aag gag acc tgt get tgg ttc act gac aac tac 1022 
Phe Lys Gin Ala val Lys Glu Thr Cys Ala Trp Phe Thr Asp Asn Tyr 
305 310 315 

gag cag gec egg aag tga agctggaaga caggatcagg tgccagcgga 1070 
Glu Gin Ala Arg Lys 
320 

ccatcggctg gcagagccca gcggccacca cccgtcaacc ctgecaggag ctgagggcac 1130 

cacccagcaa cctgggcctg cattccatcc gctctgcagc cccaagcatc tttccagtgg 1190 

ggcccccatt cacgttggtc ctcagggaaa ccagggtccg gggcaggccc ggcgctttgc 1250 

tccccacacc agccccctgc gcgtgtccac tctgatcctg catcccactc cctgggagcc 1310 

aataaagtgc attttcacag aaaaaaaaaa 1340 



830 



878 



926 



974 



<210> 6 
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<211> 321 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Met Gly Glu Pro Gin Gly Ser Met Arg He Leu Val Thr Gly Gly Ser 
15 10 15 

Gly Leu Val Gly Lys Ala He Gin Lys Val Val Ala Asp Gly Ala Gly 
20 25 30 

Leu Pro Gly Glu Asp Trp Val Phe Val Ser Ser Lys Asp Ala Asp Leu 
35 40 45 

Thr Asp Thr Ala Gin Thr Arg Ala Leu Phe Glu Lys Val Gin Pro Thr 
50 55 60 

His Val He His Leu Ala Ala Met Val Gly Gly Leu Phe Arg Asn He 
65 70 75 80 

Lys Tyr Asn Leu Asp Phe Trp Arg Lys Asn Val His Met Asn Asp Asn 
85 90 95 

Val Leu His Ser Ala Phe Glu Val Gly Ala Arg Lys Val Val Ser Cys 
100 105 110 

Leu Ser Thr Cys He Phe Pro Asp Lys Thr Thr Tyr Pro He Asp Glu 
115 120 125 

Thr Met He His Asn Gly Pro Pro His Asn Ser Asn Phe Gly Tyr Ser 
130 135 140 

Tyr Ala Lys Arg Met He Asp Val Gin Asn Arg Ala Tyr Phe Gin Gin 
145 150 155 160 

Tyr Gly Cys Thr Phe Thr Ala Val He Pro Thr Asn Val Phe Gly Pro 
165 170 175 

His Asp Asn Phe Asn He Glu Asp Gly His Val Leu Pro Gly Leu He 
180 185 190 

His Lys Val His Leu Ala Lys Ser Ser Gly Ser Ala Leu Thr Val Trp 
195 200 205 

Gly Thr Gly Asn Pro Arg Arg Gin Phe He Tyr Ser Leu Asp Leu Ala 
210 215 220 

Gin Leu Phe He Trp val Leu Arg Glu Tyr Asn Glu Val Glu Pro He 
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225 



230 



235 



PCT/US99/11576 
240 



He Leu Ser Val Gly Glu Glu Asp Glu Val Ser He Lys Glu Ala Ala 
245 250 255 

Glu Ala Val Val Glu Ala Met Asp Phe His Gly Glu Val Thr Phe Asp 
260 265 270 

Thr Thr Lys Ser Asp Gly Gin Phe Lys Lys Thr Ala Ser Asn Ser Lys 
275 280 285 

Leu Arg Thr Tyr Leu Pro Asp Phe Arg Phe Thr Pro Phe Lys Gin Ala 
290 295 300 

Val Lys Glu Thr Cys Ala Trp Phe Thr Asp Asn Tyr Glu Gin Ala Arg 
305 310 315 320 

Lys 



<210> 7 
<211> 1017 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)..{1017) 

<400> 7 

atg aga gtt ctg gtt acc ggt ggt age ggt tac att gga agt cat acc 48 

Met Arg Val Leu Val Thr Gly Gly Ser Gly Tyr He Gly Ser His Thr 
15 10 15 

tgt gtg caa tta ctg caa aac ggt cat gat gtc ate att ctt gat aac 96 
Cys Val Gin Leu Leu Gin Asn Gly His Asp Val He He Leu Asp Asn 
20 25 30 

etc tgt aac agt aag cgc age gta ctg cct gtt ate gag cgt tta ggc 144 
Leu Cys Asn Ser Lys Arg Ser Val Leu Pro Val He Glu Arg Leu Gly 
35 40 45 

ggc aaa cat cca acg ttt gtt gaa ggc gat att cgt aac gaa gcg ttg 192 
Gly Lys His Pro Thr Phe Val Glu Gly Asp He Arg Asn Glu Ala Leu 
50 55 60 

atg acc gag ate ctg cac gat cac get ate gac acc gtg ate cac ttc 240 
Met Thr Glu He Leu His Asp His Ala He Asp Thr Val He His Phe 



12 



W ° 99/64618 PCT/US99/U576 
65 ™ 75 eo 



gcc ggg ctg aaa gcc gtg ggc gaa teg gta caa aaa ccg ctg gaa tat 
Ala Gly Leu Lys Ala Val Gly Glu Ser Val Gin Lys Pro Leu Glu Tyr 
85 90 95 

tac gac aac aat gtc aac ggc act ctg cgc ctg att age gcc atg cgc 
Tyr Asp Asn Asn Val Asn Gly Thr Leu Arg Leu He Ser Ala Met Arg 
100 105 no 

gcc get aac gtc aaa aac ttt att ttt age tec tec gcc acc gtt tat 
Ala Ala Asn Val Lys Asn Phe He Phe Ser Ser Ser Ala Thr Val Tyr 
115 120 125 

ggc gat cag ccc aaa att cca tac gtt gaa age ttc ccg acc ggc aca 
Gly Asp Gin Pro Lys He Pro Tyr Val Glu Ser Phe Pro Thr Gly Thr 
130 135 140 

ccg caa age cct tac ggc aaa age aag ctg atg gtg gaa cag ate etc 
Pro Gin Ser Pro Tyr Gly Lys Ser Lys Leu Met Val Glu Gin He Leu 
145 "0 155 i 6 o 

acc gat ctg caa aaa gcc cag ccg gac tgg age att gcc ctg ctg cgc 
Thr Asp Leu Gin Lys Ala Gin Pro Asp Trp Ser He Ala Leu Leu Arg 
165 170 i 75 



ccg caa ggc att ccg aat aac ctg atg cca tac ate gcc cag gtt get 
Pro Gin Gly He Pro Asn Asn Leu Met Pro Tyr He Ala Gin Val Ala 
195 200 205 

gta ggc cgt cgc gac teg ctg gcg att ttt ggt aac gat tat ccg acc 
Val Gly Arg Arg Asp Ser Leu Ala He Phe Gly Asn Asp Tyr Pro Thr 
210 215 220 

gaa gat ggt act ggc gta cgc gat tac ate cac gta atg gat ctg gcg 
Glu Asp Gly Thr Gly Val Arg Asp Tyr He His Val Met Asp Leu Ala 

225 230 ooc 

235 240 

gac ggt cac gtc gtg gcg atg gaa aaa ctg gcg aac aag cca ggc gta 
Asp Gly His Val Val Ala Met Glu Lys Leu Ala Asn Lys Pro Gly Val 
245 250 255 

cac ate tac aac etc ggc get ggc gta ggc aac age gtg ctg gac gtg 
His He Tyr Asn Leu Gly Ala Gly Val Gly Asn Ser Val Leu Asp Val 



286 



336 



384 



432 



480 



528 



tac ttc aac ccg gtt ggc gcg cat ccg teg ggc gat atg ggc gaa gat 576 
Tyr Phe Asn Pro Val Gly Ala His Pro Ser Gly Asp Met Gly Glu Asp 
180 185 190 



624 



672 



720 



768 



816 
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260 



265 
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270 



gtt aat gcc ttc age aaa gec tgc ggc aaa ccg gtt aat tat cat ttt 864 
Val Asn Ala Phe Ser Lys Ala Cys Gly Lys Pro Val Asn Tyr His Phe 
275 280 285 

gca ccg cgt cgc gag ggc gac ctt ccg gcc tac tgg gcg gac gcc age 912 
Ala Pro Arg Arg Glu Gly Asp Leu Pro Ala Tyr Trp Ala Asp Ala Ser 
290 295 300 

aaa gcc gac cgt gaa ctg aac tgg cgc gta acg cgc aca etc gat gaa 960 
Lys Ala Asp Arg Glu Leu Asn Trp Arg Val Thr Arg Thr Leu Asp Glu 
305 310 315 320 

atg gcg cag gac acc tgg cac tgg cag tea cgc cat cca cag gga tat 1008 
Met Ala Gin Asp Thr Trp His Trp Gin Ser Arg His Pro Gin Gly Tyr 
325 330 335 

ccc gat taa 1017 
Pro Asp 



<210> 8 
<211> 338 
<212> PRT 

<213> Escherichia coli 
<400> 8 

Met Arg Val Leu Val Thr Gly Gly Ser Gly Tyr lie Gly Ser His Thr 
15 10 15 

Cys Val Gin Leu Leu Gin Asn Gly His Asp Val lie lie Leu Asp Asn 
20 25 30 

Leu Cys Asn Ser Lys Arg Ser Val Leu Pro Val lie Glu Arg Leu Gly 
35 40 45 

Gly Lys His Pro Thr Phe Val Glu Gly Asp lie Arg Asn Glu Ala Leu 
50 55 60 

Met Thr Glu He Leu His Asp His Ala He Asp Thr Val He His Phe 
65 70 75 80 

Ala Gly Leu Lys Ala Val Gly Glu Ser Val Gin Lys Pro Leu Glu Tyr 
85 90 95 

Tyr Asp Asn Asn Val Asn Gly Thr Leu Arg Leu He Ser Ala Met Arg 
100 105 110 
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Ala Ala Asn Val Lys Asn Phe lie Phe Ser Ser Ser Ala Thr Val Tyr 
115 120 125 

Gly Asp Gin Pro Lys lie Pro Tyr Val Glu Ser Phe Pro Thr Gly Thr 
130 135 140 

Pro Gin Ser Pro Tyr Gly Lys Ser Lys Leu Met Val Glu Gin lie Leu 
145 150 155 160 

Thr Asp Leu Gin Lys Ala Gin Pro Asp Trp Ser He Ala Leu Leu Arg 
165 170 175 

Tyr Phe Asn Pro Val Gly Ala His Pro Ser Gly Asp Met Gly Glu Asp 
180 185 190 

Pro Gin Gly He Pro Asn Asn Leu Met Pro Tyr He Ala Gin Val Ala 
195 200 205 

Val Gly Arg Arg Asp Ser Leu Ala He Phe Gly Asn Asp Tyr Pro Thr 
210 215 220 

Glu Asp Gly Thr Gly Val Arg Asp Tyr He His Val Met Asp Leu Ala 
225 230 235 240 

Asp Gly His Val Val Ala Met Glu Lys Leu Ala Asn Lys Pro Gly Val 
245 250 255 

His He Tyr Asn Leu Gly Ala Gly Val Gly Asn Ser Val Leu Asp Val 
260 265 270 

Val Asn Ala Phe Ser Lys Ala Cys Gly Lys Pro Val Asn Tyr His Phe 
275 280 285 

Ala Pro Arg Arg Glu Gly Asp Leu Pro Ala Tyr Trp Ala Asp Ala Ser 
290 295 300 

Lys Ala Asp Arg Glu Leu Asn Trp Arg Val Thr Arg Thr Leu Asp Glu 
305 310 315 320 

Met Ala Gin Asp Thr Trp His Trp Gin Ser Arg His Pro Gin Gly Tyr 
325 330 335 

Pro Asp 



<210> 9 
<211> 1047 



15 



WO 99/64618 

<212> DNA 

<213> Homo sapiens 



PCT/US99/11576 



<220> 

<221> CDS 

<222> (1) - • (1047) 



<400> 9 

atg gca gag aag gtg ctg gta aca ggt ggg get ggc tac att ggc age 48 

Met Ala Glu Lys Val Leu Val Thr Gly Gly Ala Gly Tyr lie Gly Ser 
15 10 15 



cac acg gtg ctg gag ctg ctg gag get ggc tac ttg cct gtg gtc ate 96 
His Thr Val Leu Glu Leu Leu Glu Ala Gly Tyr Leu Pro Val Val He 
20 25 30 



gat aac ttc cat aat gec ttc cgt gga ggg ggc tec ctg cct gag age 
Asp Asn Phe His Asn Ala Phe Arg Gly Gly Gly Ser Leu Pro Glu Ser 
35 40 45 



144 



ctg egg egg gtc cag gag ctg aca ggc cgc tct gtg gag ttt gag gag 192 
Leu Arg Arg Val Gin Glu Leu Thr Gly Arg Ser Val Glu Phe Glu Glu 
50 55 60 



atg gac att ttg gac cag gga gee eta cag cgt etc ttc aaa aag tac 
Met Asp He Leu Asp Gin Gly Ala Leu Gin Arg Leu Phe Lys Lys Tyr 
65 70 75 80 



age ttt atg gcg gtc ate cac ttt gcg ggg etc aag gee gtg ggc gag 
Ser Phe Met Ala Val He His Phe Ala Gly Leu Lys Ala Val Gly Glu 
85 90 95 



288 



teg gtg cag aag cct ctg gat tat tac aga gtt aac ctg ace ggg ace 336 

Ser Val Gin Lys Pro Leu Asp Tyr Tyr Arg Val Asn Leu Thr Gly Thr 

100 105 HO 

ate cag ctt ctg gag ate atg aag gee cac ggg gtg aag aac ctg gtg 384 

He Gin Leu Leu Glu He Met Lys Ala His Gly Val Lys Asn Leu Val 
115 120 125 

ttc age age tea gee act gtg tac ggg aac ccc cag tac ctg ccc ctt 432 

Phe Ser Ser Ser Ala Thr Val Tyr Gly Asn Pro Gin Tyr Leu Pro Leu 
130 135 140 



gat gag gee cac ccc acg ggt ggt tgt acc aac cct tac ggc aag tec 
Asp Glu Ala His Pro Thr Gly Gly Cys Thr Asn Pro Tyr Gly Lys Ser 
145 150 155 . 160 
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aag ttc ttc ate gag gaa atg ate egg gac ctg tgc cag gca gac aag 528 

Lys Phe Phe lie Glu Glu Met He Arg Asp Leu Cys Gin Ala Asp Lys 

165 170 175 

act tgg aac gta gtg ctg ctg cgc tat ttc aac ccc aca ggt gee cat 576 
Thr Trp Asn Val Val Leu Leu Arg Tyr Phe Asn Pro Thr Gly Ala His 
180 185 190 

gec tct ggc tgc att ggt gag gat ccc cag ggc ata ccc aac aac etc 624 
Ala Ser Gly Cys He Gly Glu Asp Pro Gin Gly He Pro Asn Asn Leu 
195 200 205 

atg cct tat gtc tec cag gtg gcg ate ggg cga egg gag gee ctg aat 672 
Met Pro Tyr Val Ser Gin Val Ala He Gly Arg Arg Glu Ala Leu Asn 
210 215 220 

gtc ttt ggc aat gac tat gac aca gag gat ggc aca ggt gtc egg gat 720 
Val Phe Gly Asn Asp Tyr Asp Thr Glu Asp Gly Thr Gly Val Arg Asp 
225 230 235 240 

tac ate cat gtc gtg gat ctg gec aag ggc cac att gca gee tta agg 768 
Tyr He His Val Val Asp Leu Ala Lys Gly His He Ala Ala Leu Arg 
245 250 255 

aag ctg aaa gaa cag tgt ggc tgc egg ate tac aac ctg ggc acg ggc 816 
Lys Leu Lys Glu Gin Cys Gly Cys Arg He Tyr Asn Leu Gly Thr Gly 
260 265 270 

aca ggc tat tea gtg ctg cag atg gtc cag get atg gag aag gec tct 864 
Thr Gly Tyr Ser Val Leu Gin Met Val Gin Ala Met Glu Lys Ala Ser 
275 280 285 

ggg aag aag ate ccg tac aag gtg gtg gca egg egg gaa ggt gat gtg 912 
Gly Lys Lys lie Pro Tyr Lys Val Val Ala Arg Arg Glu Gly Asp Val 
290 295 300 

gca gec tgt tac gec aac ccc age ctg gec caa gag gag ctg ggg tgg 960 
Ala Ala Cys Tyr Ala Asn Pro Ser Leu Ala Gin Glu Glu Leu Gly Trp 
305 310 315 320 



aca gca gee tta ggg ctg gac agg atg tgt gag gat etc tgg cgc tgg 
Thr Ala Ala Leu Gly Leu Asp Arg Met Cys Glu Asp Leu Trp Arg Trp 
325 330 335 



1008 



cag aag cag aat cct tea ggc ttt ggc acg caa gee tga 1047 
Gin Lys Gin Asn Pro Ser Gly Phe Gly Thr Gin Ala 
340 345 
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<210> 10 
<211> 348 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met Ala Glu Lys Val Leu Val Thr Gly Gly Ala Gly Tyr lie Gly Ser 
15 10 15 

His Thr Val Leu Glu Leu Leu Glu Ala Gly Tyr Leu Pro Val Val lie 
20 25 30 

Asp Asn Phe His Asn Ala Phe Arg Gly Gly Gly Ser Leu Pro Glu Ser 
35 40 45 

Leu Arg Arg Val Gin Glu Leu Thr Gly Arg Ser Val Glu Phe Glu Glu 
50 55 60 

Met Asp lie Leu Asp Gin Gly Ala Leu Gin Arg Leu Phe Lys Lys Tyr 
65 70 75 80 

Ser Phe Met Ala Val lie His Phe Ala Gly Leu Lys Ala Val Gly Glu 
85 90 95 

Ser Val Gin Lys Pro Leu Asp Tyr Tyr Arg Val Asn Leu Thr Gly Thr 
100 105 110 

lie Gin Leu Leu Glu lie Met Lys Ala His Gly Val Lys Asn Leu Val 
115 120 125 

Phe Ser Ser Ser Ala Thr Val Tyr Gly Asn Pro Gin Tyr Leu Pro Leu 
130 135 140 

Asp Glu Ala His Pro Thr Gly Gly Cys Thr Asn Pro Tyr Gly Lys Ser 
145 150 155 160 

Ly* Phe Phe lie Glu Glu Met lie Arg Asp Leu Cys Gin Ala Asp Lys 
165 170 175 

Thr Trp Asn Val Val Leu Leu Arg Tyr Phe Asn Pro Thr Gly Ala His 
180 185 190 

Ala Ser Gly Cys lie Gly Glu Asp Pro Gin Gly He Pro Asn Asn Leu 
195 200 205 

Met Pro Tyr Val Ser Gin Val Ala He Gly Arg Arg Glu Ala Leu Asn 
210 215 220 
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Val Phe Gly Asn Asp Tyr Asp Thr Glu Asp Gly Thr Gly Val Arg Asp 
225 230 235 240 

Tyr lie His Val Val Asp Leu Ala Lys Gly His lie Ala Ala Leu Arg 
245 250 255 

Lys Leu Lys Glu Gin Cys Gly Cys Arg lie Tyr Asn Leu Gly Thr Gly 
260 265 270 

Thr Gly Tyr Ser Val Leu Gin Met Val Gin Ala Met Glu Lys Ala Ser 
275 280 285 

Gly Lys Lys He Pro Tyr Lys Val Val Ala Arg Arg Glu Gly Asp Val 
290 295 300 

Ala Ala Cys Tyr Ala Asn Pro Ser Leu Ala Gin Glu Glu Leu Gly Trp 
305 310 315 320 

Thr Ala Ala Leu Gly Leu Asp Arg Met Cys Glu Asp Leu Trp Arg Trp 
325 330 335 



Gin Lys Gin Asn Pro Ser Gly Phe Gly Thr Gin Ala 
340 345 



<210> 11 
<211> 317 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : CONSENSUS 
<400> 11 

Xaa Xaa Arg Xaa Xaa Xaa Xaa Gly Xaa Xaa Gly Xaa Xaa Gly Xaa Xaa 
15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Xaa Xaa Xaa Ala Xaa Xaa Xaa 
5 ° 55 60 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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65 70 75 80 

Asn Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
100 105 110 

Xaa Xaa Xaa Pro Xaa Xaa Glu Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa 
115 120 125 

Xaa Xaa Xaa Xaa Xaa Tyr Xaa Xaa Xaa Lys Xaa Xaa Xaa Xaa Xaa Xaa 
130 135 140 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
145 150 155 160 

Xaa Xaa Asn Xaa Xaa Gly Xaa His Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
165 170 175 

Xaa Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
180 185 190 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Gly Xaa Xaa Xaa Arg Xaa 
195 200 205 

Xaa Xaa Xaa Xaa Xaa Asp Xaa Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
210 215 220 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
225 230 235 240 

Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
245 250 255 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Xaa Xaa 
260 265 270 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
275 280 285 

Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
290 295 300 

Xaa Thr Xaa Xaa Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
305 310 315 
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<210> 12 
<211> 34 
<212> DNA 

<213> Escherichia coli 
<400> 12 

tagaattcag taaacaacga gtttttattg ctgg 34 

<210> 13 
<211> 32 
<212> DNA 

<213> Escherichia coli 



<210> 14 
<211> 30 
<212> DNA 

<213> Escherichia coli 
<400> 14 

ctggagtcga attcatgagt aaacaacgag 30 

<210> 15 
<211> 33 
<212> DNA 

<213> Escherichia coli 



<400> 13 



aactcgagtt acccccaaag cggtcttgat tc 



32 



<400> 15 



aactgcagtt acccccgaaa gcggtcttga ttc 



33 
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